GENERALIZED LIMITS IN GENERAL ANALYSIS” 
SECOND PAPER 


BY 
CHARLES N. MOORE 


In a previous paper of the same titley I have developed the fundamental 
principles of a general theory which includes as particular instances the 
theories of Cesaro and Hélder summability of divergent series and divergent 
integrals. I further made use of these fundamental principles to prove a general 
theorem which includes as special cases several important theorems in the 
above mentioned special theories. 

In the present paper the general theory referred to above is extended to 
the case of multiple limits and the theorem mentioned is likewise generalized. 
The theorem thus obtained includes as special cases the extension to multiple 
series of the Knopp-Schnee-Ford theoremt on the equivalence of Cesaro and 
Holder summability for divergent series, the extension to multiple integrals of 
the analogous theorem of Landaut for the case of divergent integrals, and the 
extension to partial derivatives of a corresponding theorem with regard to the 
equivalence of certain generalized derivatives. Once the principles of the 
theory are set forth, the proof of this general theorem is fully as simple as 
the proofs of any of the special theorems would be. Thus we have exhibited 
the greater power of the methods of General Analysis as compared with the 
methods of classical analysis. 

The basis of our general theory may be indicated as follows: 


9; on Ss. ..-. (i 1 : 
Si, Ss, --- (i = 1, 


§ S1, Sa, Sy, to S;,Ss, .... Sn to 


to H,; on H, to = 1. 


on@ toH; on H tos) | 


Presented to the Society April 14, 1922. 
+ These Transactions, vol. 24 (1922), pp. 79-88. 
t For references to the literature dealing with the special theorems referred to, see Paper J. 
459 
82 


4 

FS 
i 


460 Cc. N. MOORE [October 


where 2% = [a] denotes the class of all real numbers a, $; = [pi] denotes 
a class of elements pj (i = 1, 2,..., m), and ©; = [co] denotes a class of 
sets o of elements p; of the range 3; (i = 1, 2,...,m); G=[y], 5: = [9] 
(i=1,2,..., m), = (¢— 1,2,..., m), = and =[¢] are 
(2m -+ 3) classes of functions, 7, p®,..., p™, 4, and @ respec- 
tively on S,, Ss, ..., Gm to W (we consider only single-valued functions); 
gy” is a special function of the class §; Ji is a functional operation turning 
a function of the class G into a function of the class $; or a function of the 
class ; into a function of the class %;, denoted by Jiy or Jin respectively; 
and J is a functional operation turning a function of the class into a function 
of the class or a function of the class $ into a function of the class ¥. 
denoted by Jy or Jy respectively. 

In order to show the relationship of our general theorem to the special 
cases of it to which we have referred, we will indicate here what the general 
basis reduces to in the particular instances III and IV. 


eu = [all «, = 1, 2, 3,... 


ni) 


(Grey On) == Ne Nm 


k,=n, 


(Ji O)(Gn,. Sn) = O( +s Skyy 
k;=1 


PSV = [all a,> 0] 
S, = [o® = (alla, such thatO<a2,<a,) 


@ = [all functions that are finite in any finite region (0<a;< 4; 
/ = 1, 2,..., m) and are integrable (Lebesgue) with respect 


= (1, 2, ... (¢ = 1, 2,.... m); 
(¢ = 1, 
1,2, m); 
ky Kn =" 
k=1 k,=1 
= 1,2,...,m; 0 = yx, 9). 


GENERALIZED LIMITS 461 


to each of the variables 2; (i = 1, .... m) on every finite inter- 
val (O<2j< a;i = 1,..., m)); 


e 
2); 


(4 >0; 2 = 


= Am 


m 


a, 
(Ga,. Ga) ={ 6 dx; (ai; 
0 


(J 4) Ga = 


m 


With regard to each of the classes ©,, @:, ..., Gm we make definitions 
analogous to those made for the class S in Paper I, and we further postulate 
analogous properties. These properties wiil be referred to by the same letters 
as in the previous paper with a subscript or index attached to indicate the 
particular class to which reference is made. When any two functions 6, on 
S,, ..-, Sm to W are regarded as functions of a single set o®, the other sets 
being held fixed, we define the notation (D;@)(o™, ..., = a(o™,...,. 


1923] 
9, 
v= 
0 
0 
Ley 
etm, Sal (a>0: ¢ = 1, ..., m) 
0 0 
Sa, ) (ai; 2 = 1,..., m); 
9%); 
an 
|... | 9). 
0 


462 C. N. MOORE [October 


in a manner entirely analogous to that in which the notation (D4) (s) = «(a) 
was defined in Paper I. 

When functions of the class G are regarded as functions of a single set o®, 
the other sets being held fixed, we postulate for them the properties of class G 
of Paper I, these properties to be referred to by the same letter with suitable 
index or subscript. Analogous properties for the classes $ and ¥, designated 
in similar fashion, are also postulated. Furthermore for the classes ); and ¥i, 
regarded as functions of the set os alone, we postulate the properties of 
classes 5 and ¥ of Paper I and indicate them in like manner. When the 
functions of classes G and §; that are involved in the operation J; are regarded 
as functions of the set o alone. we require J; to have all the properties 
required of J in Paper I, which properties we shall designate by the same 
symbols with subscript or index 7. We further postulate that any of the 
operations J,, Jo, ..., Jm is interchangeable with any other of the set, which 
property we designate as (J). We also postulate as to the relationship be- 
tween J and J;, Je,...,- Jim that 


(o,..., = (J, (a, ..., o™). 


With regard to the special function pi” (o,..., 3) we postulate that 


(X) gr (a) gm) — (a?) (a), 


where Pp (os) as function of o, for i= 1,2,..., m, is the same function 
as Po (c) of Paper I as function of cs. We also postulate for the class § that 
(Jip) (o™,..., for /=1,2...., m, is of the class which property 
we designate as (KX). 

For the sake of brevity we shall agree to represent, in all cases where no 
loss of clearness is involved, the set of elements p™,..., p by the single 
symbol p, the set of classes BP, ..., PB” by FP, the set of classes S™, ..., GI” 
by S, and the set of sets o”,..., 0 by o. Analogous to the definition of 
the notation lim, 6(¢) = a in Paper I, we define the corresponding notation 
in the case that co represents the set of sets o,.... o” to mean that corre- 
sponding to every positive e¢ there exist sets m, are om) such that for sets 
6%>o0 (7 = 1, 2,..., m) we have |6(s) —a, <e. 

We then postulate as to the class the property (B) defined by 
(‘B) If lim, 7 (c) exists and is equal to a, then |y(s)|< a (¢). 


1923] GENERALIZED LIMITS 4163 
If we also for the sake of brevity agree to represent a group of properties 


such as R,, Rz,..., Rm by the single letter R, we may indicate the foundation 
of our theory as follows: 


> UAR onStoA%. LP, L,P; 


C. 4, StoA.LPS,B 
onStoU.LPS,C IK. (m)%. X 


7,02 to on to My? My’ I) 15’ I 
vt 
pou Sto H.on HtoF 


We now set 


(1) Pin Pon (o) ) Pon ( o*)) Pon (o™), 


where ¢,,, (a), as function of o, is the same function as @,,,(¢), as function 
of o, defined by equation (3) of Paper I. We are then ready to define the 
two generalized limits with which we shall be concerned. Given any function 
we set 


(n!)™ 
(2) (Cua) (9) = 
M Je\(a 
(3) (M 9) (. y)(s), 
(4) (Ay) = (2). 


If for a fixed » lim,(C, 7) (¢) exists, we define this limit as the generalized 
limit of type (Cn) for y(c). If lim,( Hn) (¢) exists, we define this limit 
as the generalized limit of type (Hn) for 7 (¢). 


464 C. N. MOORE [October 


Before proceeding to the proof of the equivalence theorem we introduce the 
following notations: 


(5) (Mix) = (0) ] (4 = 1,..., m), 


Pon (2) Don (n), 


(6) (9) = 


= Polo) Po Po (s>2; «= 1,..., m), 
(7) 

= Po(o) x(a). y(s) = (¢ = 1,...,), 
where 3(, of, ... are defined with regard to o” in the same manner as 
01, S2,... With regard to o in Paper I: 

(8) (S®y)(s) = i++ (n), 
(9) (6) = (SP (S@--- (SP (0) (ny i=1,..., m), 
(10) (Spx) (o) = (SP (SRP 7) (a) (n), 
(7 
(11) = ny(e) - (J; 7) (9) ¢== 1, ...,m). 


Pon 


We are now ready for the proof of our theorem: we begin by proving some 
lemmas. 
LEMMA 1. Jf we define Sy, as in (10), we have the identity 


(12) (Sn (Cn y)) (a) = (M(Chr-1 9)) (9) (n). 


We have from Lemma 1 of Paper I and the interchangeability of the various 
operations involved 


1923) GENERALIZED LIMITS 465 


(Sy (Cn )) (a) -,m—1) ..m—1) n)) )) (c) 


( +, m—2) ia” (My, (c™, (c) 


(so m--2) (Mn (ce) ™ n))) 


— (M, eee (M,, eee )) (cs) = (M(Cy-1 (co). 


Our lemma is therefore established. 
We define yp‘? in a manner analogous to the definition of y in (7). We also set 


Pon-i (5) = Pon 


We then prove 

LEMMA 2. Jf lim, exists and is equal to a and \p(s)|< a, for every 
then lim, [9,,, (J, will exist and be equal to a/n and we shall 
have 


[Pon (J; P®) (8), < i= 1,..., m). 


Given a positive e, we choose 32 so that a—(e/4)< p(s)<a+(e/4) for 
We have 


[Pon (J; PP) (9) 
(13) = [9,, (J, ..., ..., 


[Pon (o@)]-! [(J; g®) (o)—(J, gp) : 


* It should be remembered throughout that o! is an abbreviation for (0°, o,.... 


and that is an abbreviation for the set of relationships 

aM 


466 C. N. MOORE 


Analogous to (18) of Paper I we have the relationship 


(14) (J, P?) = (4, (D,; |) (2). 
on the right hand side of (13) lies between 


4 


Making use of (14), and postulates 1% and J, we see that the second term 


1 Pon (9) 1 Pon 


From (IV) of Paper I it follows that for a proper choice of a,’ > 0 the above 
expression differs from a/n by a quantity that is less in absolute value than 


Le for alla 


The first term on the right side of (13) is seen from (14), 14”, and I? to 


be less in absolute value than 


Pon “e ) 


n (os) 


From IV of Paper | it follows that we can choose o” > 0, so as to make this 
expression less in absolute value than }¢ for c> 0”. If now we choose for o, 


the greater of and 92”, it follows from (13) that for 


[9 (J, (8) — (a/n)! <e. 


and the first part of our conclusion is established. 
Making use of (14) and M;”, we have 


(J, 9) (8) < 


ay 
n 


which establishes the second part of our conclusion. 


[October 


1925| GENERALIZED LIMITS 467 


LEMMA 3. Jf lim, exists and is equal toa, and < a, for every s, 
then will exist and be equal to a and weshall have (Sr < as 
Sor every 

By virtue of definition (10) the operation S, is equivalent to a succession 
of operations S\’ for i —1,2,..., m. It follows from Lemma 2, for the 
case n = 1, that if a function gy (¢) remains finite for all ¢ and approaches 
a limit as to o, the same is true for the function resulting from the operation 
S\? applied to g(a). Hence by a succession of m applications of Lemma 2, 
we obtain the conclusion of the present lemma. 

We now set 


(15) yi = p)(c) 1,2,...,m). 
We then prove 


LEMMA 4. If for any i lim, 9j(s) exists and is equal to a, and gj(s), <a 
Jor every 3, then lim, p(c) will exist and be equal to a, and we shall have 
< ag for every 5. 
By a procedure analogous to that used in the proof of Lemma 3 of Paper | 
we may transform equation (15) into the form 


(16) = (Te g)(s) (n>2;i=1,...,m), 


where 7 is defined by equation (11). Our lemma then follows from Lemma 2 
forn>2. For n = 1 it is an obvious consequence of (15) and (8). 


LEMMA 5. Jf lim, (S» ~) (¢) exists and is equal to a and a 
Jor every 5, then lim, y(o) will exist and be equal to a and we shall have 
p(s) < dz for every 

Making use of the definition of S, given in equation (10), we see that this 
lemma may be established by successive applications of Lemma 4. 

Noting that S, and M are interchangeable operations, we have from suc- 
cessive applications of (12), in a manner analogous to the corresponding 
reductions in Paper I by means of equation (14) of that paper, 


(17) (Hn 9) (Sn-1 (Sp (Cy (n), 


468 N. MOORE 


We are now ready to prove our theorem: 

THEOREM. Jf lim, (C, 7) (o) exists and is equal to a, then lim, (Hy 4) (s) 
will exist and be equal to a, and conversely. 

From (17), (B), and successive applications of Lemma 3, we obtain the 
result : 

Tf there exists lim, (Cn 4) = a, then there exists limg (Hn 4) (5) = a(n). 

From (17), (8B), and suecessive applications of Lemma 5. we obtain the 
result: 

If there exists lim, (A, y)(s)—a, then there exists lim, (Ch 7) )(o) = a(n). 

Our theorem is therefore established. 


UNIVERSITY OF CINCINNATI. 
CINCINNATI, OHIO. 


BY 


B. H. BROWN 


THE EQUILONG TRANSFORMATIONS OF EUCLIDEAN SPACE” 


In the classical non-euclidean geometries of space of » dimensions, distance 
as well as angle has a projective definition, and equilong transformations are 
the dual of conformal transformations by polar reciprocation in the absolute. 
In euclidean space the projective definition is lost, but while the preceding 
duality breaks down, Scheffers+ exhibited a perfect analogy in the euclidean 
plane by the use of the dual numbers of Study. We know that for any function 


of the complex variable 


Katiy) = Via, y), 


where X and Y satisty the Cauchy-Riemann differential equations 


(1) ox oy ox 
dx Oy” oy Oa’ 


the point transformation 


(2) X= X(z,y), Y= V(a,y) 


is directly conformal. Scheffers proved that if « and «+ denote the Hessian 
normal coérdinates of an oriented line (v7 the distance parameter), for any 


function of the dual number 
Klutev) = Ulu, v)+eVia, v), 
where U and V satisfy the differential equations 


* Presented to the Society, December 27, 1922. 
+ Mathematische Annalen, vol. 60 (1905), p. 491. 
469 


= 
/ 
1. 
88 


470 B. H. BROWN {October 


the oriented line transformation 


U = U(u,v), V = V(u, v) 


is directly equilong. 

Since the conformal group in non-euclidean as well as in euclidean three- 
space is a ten-parameter group, the equilong group in non-euclidean three- 
space depends on ten parameters. But the equilong group in euclidean space 
contains arbitrary functions.* In space of more than three dimensions, the 
conformal euclidean group, and the conformal and equilong non-euclidean 
groups contain a finite number of parameters, but Coolidge+ has shown that: 
The most general equilong transformation of a euclidean space of n dimensions 
depends on the most general conformal transformation of a space of n—1 
dimensions and an arbitrary function of the direction parameters. The distance 
parameter enters linearly. 

The above theorem is true for n >3, but the last statement is also true 
tor n = 2. since the integration of (3) gives 


(4) U = U(u), V = U'. r+ 0, 


This fact leads to a hitherto unnoticed analogy between the conformal and 
equilong transformations in the plane, and to a sharpening of the contrast in 
higher spaces. The functions X and Y of (1) satisfy Laplace’s equation. 
Again in Study’s formulation of the conformal (and therefore equilong) trans- 
formations in the Riemannian and Lobatschewskian planes, the functions of 
hypercomplex variables are separable into functions satisfying either Laplace’s 
equation or the hyperbolic form 


eo 
ay? 


* This remarkable theorem was first enunciated, without proof, by Study, Sitzungs- 
berichte der Niederrheinischen Gesellschaft fiir Natur- und Heilkunde, Dec. 5, 
1904. In 1908 Coolidge gave the first published proof of this theorem, and a correct 
explicit form for these transformations in these Transactions, vol. 9 (1908), p. 178. An 
incorrect derivation leading to a ten-parameter group was given by Loehrl in his Wiirz- 
burg dissertation (1910). A demonstration, independent of Coolidge’s, was given by Blaschke, 
Archiv der Mathematik und Physik, vol. 16 (1910), p. 182. The final form of these 
transformations is, however, incorrect with respect to a distinction of signs. This error has 
never, to our knowledge, been corrected. In 1916 Coolidge in his Treatise on the Circle 
and ‘the Sphere, p. 419, changing the correct form of his 1908 paper, reproduced Blaschke’s 
incorrect form. 


+ Loe. cit., p. 182. 


1923] EQUILONG TRANSFORMATIONS 471 


equations which are not essentially distinct for complex solutions. Finally 
the functions U and V of (4) satisfy the parabolic equation 


2 
(5) 


But while the functions in the equations of an equilong transformation in 
n-dimensional euclidean space are non-trivial solutions of (5) (v denoting the 
distance parameter), the analogy is completely lost in the other cases. 

In this paper we give in Section 2 a new demonstration of the fundamental 
equations for equilong transformations in euclidean three-space. The main 
portion of this paper is then devoted to a discussion of groups of these trans- 
formations which leave invariant various differential expressions and equations. 


The equation 


ute i(u — v) w 


(6) 


represents an oriented plane such that the direction cosines of its oriented 
normal are the coefficients of x, y, and z, and such that the distance from the 


origin to the plane is 3 
I 1+uv 


of this oriented plane. Exceptional cases occur when: (a) 1 4- wv = 0 (minimal 
plane); (b) the spherical representation of the plane is a point on a ruling 
through the south pole. The point of contact of a plane and any envelope 
which touches the plane is given by the equations 


. Then w, v, w are Bonnet* tangential coérdinates 


= w, 


(4) = = 
Ow 

q 


The square of the distance between two points of a plane is 


(8) — pe) (Gi — Qe). 


*Liouville’s Journal, ser. 2, vol. 5 (1860), pp. 153-266. 


7 
Ow 
83* 


472 B. H. BROWN [October 
In any plane transformation 
U = U(a, v, w), 
(9) V = V(u, v, w), 
W = W(u, v, w), J#0. 


corresponding points of two corresponding planes are projectively related. 
To find the equations of the equilong transformations we simplify the form 
of (9) by imposing the necessary conditions that the collineation be 


(a) affine; 
(b) directly or indirectly conformal. 


Blaschke has shown that under these impositions the once-extended trans- 


formations are 


Direct: U 


U (au). Indirect: U = 
= V(r), =< te), 
W = Wla,e.w). W = W(u,v,w). 

1 oW 1 aw 
U | Ou | dwt or!’ 

1 ow 1 aw 

V | aut! ? Ou 


It is now necessary and sufficient to impose the condition that 


(p, 


be an absolute invariant. 


(P, — P,) (Q, — Q) 


Hence 


(P, — P2) — %) = 


~ Py) 


Tn either case we have 


Po) 


Ow 


1923] EQUILONG TRANSFORMATIONS 473 


We thus have as our fundamental equations 


Direct: U = U(u), Indirect: U = U(r), 
(10) = Pte, V = V(w), 
W= VU'V' w+ Flu, W VU'V' w+F(u,v). 


Blaschke, and subsequent writers, incorrectly insert a + sign under the 
radicals of (10). For such transformations the plane projectivity is either 
directly conformal and indirectly equiareal, or indirectly conformal and directly 
equiareal. In neither of these cases is square of distance preserved. 


Westate, without proof, the fundamental formulas in the differential geometry 
of a non-developable oriented surface w = w(u,v). The coérdinates of 


a point of tangency are given by 


2 
pu 


l+ur 
vw—qu+p 
(11) 
: 1+ ur 
Let — =r, ——_ =, Alt = ft; then the three fundamental forms are 
Ov 
ds* = rizt+s)dv?4+ 
r 2(z+s) t 
(12) du dudi di 
4dudv 
do? = — >: 
The total curvature is 
—4 
(13) (1+ uv) {rt—(z+s)*}’ 


474 B. H. BROWN [October 


the mean curvature 
(1+ 


(14) 
the lines of curvature are given by 
(15) rdu’—tdv? = 0; 
the radii of principal curvature by 


Vrt) 


(16) R = 5} 
and the centers of curvature by 

X+iY = q—u(s+V*rt). 
(17) X—iY¥ = p—v(s+V rt), 


w — up—vg—(1— uv) (s+V rt) 


Z= 


The differential equations of minimal curves and of asymptotic curves follow 
from (12). The differential equation of minimal surfaces is 


(18) z+s = 0, rt $0; 

of spheres (oriented, non-null spheres) 

(19) r=t=0, z+s +0; 

of points, which are not to be excluded on the score of the discriminant of the 
first quadratic form vanishing, but which are proper envelopes of oo?” planes, 


and may be regarded either as minimal surfaces or as spheres, 


(20) r=t=2+8s = 0. 


Among the indirect equilong transformations, the analogue of the identity 
is that one which merely reverses the orientation of a plane, without changing 


4. 


1923] - EQUILONG TRANSFORMATIONS 47 


our 


its position. This transformation we term the “pseudo-identity”. It is clear 
that every indirect transformation is the product of a direct transformation 
and the pseudo-identity. The twice-extended form of the pseudo-identity is 


vy’ 
en 
u’ 
uv 
_ w—vg 
(21) u 
Q 
R _ vt 
a ’ 
S = up+vqa—w—uvs, 
wr 


It will be observed that the equations and expressions (11) to (20) are 
invariant (invariant except for sign) under (21) as they have geometric signi- 
ficance independent of (dependent on) orientation. The differential geometry 
of oriented surfaces is the interpretation of the differential invariants of the 
extended pseudo-identity. A surface whose equation is invariant under the 
pseudo-identity is obviously one-sided. We have then the 

THEOREM. A necessary and sufficient condition that a surface w = f (u,v) be 


one-sided is that f satisfy the functional equation f (u,v) = —uv / a *). 


We shall next prove the 

THEOREM. Any oriented non-developable surface may, by each of two and 
only two distinct, direct equilong transformations, be transformed into any 
other oriented non-developable surface, and that with an arbitrary analytic 
directly conformal mapping of their spherical representations. 

This theorem was suggested by Study in his 1904 paper, but he stated, 
incorrectly, that there was one and only one such transformation. 


476 B. H. BROWN [October 


Let us consider two surfaces 


w= f, (U,V), 


w = fo (u,v), 


and let us assume that the spherical representation of the first (U, V) is con- 
formally mapped on the spherical representation of the second (w, 7) by the 
directly conformal transformation 


U = VU(u). 


The theorem is proved if, in the group of transformations 
U = U(u), 
= 
Flu, v). 


we can determine two and only two functions F(u, v) such that the first 
surface is transformed into the second. This means that 


VU Flare) (Ua), 


must be identical with 


= fe(u,v), 
which is true when and only when 
Fiu.v) = Viv)) — VU'V' fo(u, v); 


hence there are always two distinct transformations. We should note that 
there is no exception when fs = 0. 


> 


6. 


’ The twice-extended form of the general direct equilong transformation may 
be written 


V = V(v). 
a 


1923] EQUILONG TRANSFORMATIONS 4 


V = Viv), 


Gite), 


W=VU'V'w+F(U. V). 


. 
wV'2U" pv’? 
U'? 
(22) 
3 
y” 
> 
i 1 1 
9’? 19 
wU" y" ) ( 8 
4(U'V')? 2U'2 (U'v’)? 
1 1 1 
wU'?V" = tu’? 
oy’? 4 


It is proposed to discuss the invariance of the equations and expressions (11) 
to (20) under (22). 
First, under a direct transformation, a necessary and sufficient condition 
that spheres transform into spheres is that # vanish with + and 7' with f. 
This requires 
— = 0. 


(23) = 0. 
Fuv = Fyv = 0. 


The first two of (23) recall the Schwarzian derivative. Integrating, we have 


y — “uts 
(24) 


F = AUV+BU+CV+D. 


478 B. H. BROWN [October 


We may and shall choose ratios so that 
ad — By a’ — py = 1. 


We have then 


(25) 
(yut+d)(yv+0) 


(24) and (25) giving the equations of the well known Laguerre group. We might 
just as easily have found these by imposing the condition that lines (more 
properly strips) of curvature go into lines of curvature. 

It is easy to verify that translations are given by 


(26) U = u, Vy = ¢, W = wt+al(uv—1)+but+evr; 


reflection in the origin by 

(27) U == V =v, W = —w; 
dilatations by 

(28) UO = un, V = pv, W = w-+a(uv+l1): 
rotations by 


aut+b dv—c w 


(29) U cut+d —hbrta (cu+d)(—brv+a) 


where ad—be = 1. Other transformations involve Laguerre inversions. 
On account of the simplicity and frequency of occurrence of the expression 

z+s, we next consider the transformations under which minimal surfaces 

transform into minimal surfaces. It is clear that any transformation 


U = u. V = w+ fi(u,v). 


where f, is a solution of z-+s = 0, will carry any minimal surface into 
a minimal surface, for the sum of two solutions of a linear homogeneous partial 
differential equation is itself a solution. To this group of transformations we 
may, from geometric considerations, adjoin (27) and (29). It turns out that 
these are the only such transformations; we term this group the “minimal 
group”. 


1923] EQUILONG TRANSFORMATIONS 479 


To prove this statement, if we impose 7-+- S = 0 on z+s — 0 we must 
have 


[zc 


nol 


4 8 [ V’ UV 


3 
+(U' V’)? [F—UFy—VFvy+(14+ UV) Fov] 


= {w—up—vqt+(1+uv)s}. 


Hence it is necessary that 

UV’ 

(32) 

1+ UV Zt 1+ uv 

4U'V' 1+ UV 2 | ryva+uv) 
(34) = 0. 


Subtracting (33) from the product of (31) and (32) we have 


(35) 


480 B. H. BROWN [October 


hence we may rewrite (31) and (32) as 


rrv2 
v2 
(36) 
r 
y’2 


Differentiating the first of (36) with regard to v, and the second with regard 
to 


UA+ UV) vert 3 
(37) 
UV) | veer 3 = 
hence 


are necessary conditions on U' and V. Now if, in (33), we substitute the value 
of (1+ uv) given in (35), and these last values of U’ and V, it is necessary that 


“:B:y:0 —y':-— p':a’, 


and the theorem is proved. 

If we examine the form of any one of (12), (13), or (14) we see without 
difficulty that any one of them is invariant if and only if the transformation 
belongs to both the Laguerre and minimal groups. The actual verification of 
this is so much a repetition of the previous proof that it is omitted. Un- 
fortunately the only such transformations are the congruent transformations; 
for the only non-parallel transformations of the minimal group are rotations, 
and of the parallel transformations of the Laguerre group, dilatations carry 
a point into a sphere which is not a minimal surface. Our results are then 
essentially negative if we impose the condition on all surfaces, but there are 
interesting special cases for groups of transformations and groups of surfaces. 


1923] EQUILONG TRANSFORMATIONS 48] 
e 


We consider only one such example: the conformal mapping of surfaces 
under equilong transformations. The differential equation of minimal curves 
on a surface is, by (12), 


The following theorems follow immediately : 

1. Under a transformation of the Laguerre group any sphere and the trans- 
formed sphere are conformally mapped. 

2. Under a transformation of the minimal group, any minimal surface and 
its transformed minimal surface are conformally mapped. 

3. The only surfaces transformed into their spherical representations with 
conformal mapping by equilong transformations are spheres and minimal 
surfaces. 

4. A minimal surface may be transformed into any sphere with conformal 
mapping, and conversely. 


(. 


Since the solutions of linear homogeneous partial differential equations 
possess the additive property, we may associate with every such equation the 
surfaces that are solutions thereof. and a corresponding group of direct 
parallel equilong transformations 


U Ww +u+ rv). 


where f is itself a solution of the given differential equation; under this group 
of transformations the surfaces are permuted among themselves. Obviously 
we shall be most interested in differential equations invariant under the 
pseudo-identity. For such differential equations there is a sub-group of trans- 
formations which will permute the one-sided surfaces of the group among 
themselves, for the solutions of the linear homogeneous functional equation 


possess the additive property. Thus, for example, the double minimal surfaces 
are permuted among themselves by the appropriate subgroup, for they are the 
only one-sided minimal surfaces. 

Let us consider a solution of (18), a minimal surface whose equation may 
be written 


uw = 2vf(u)+2ufi(v) —U+ur) fi(e)). 


482 B. H. BROWN | October 


We may associate with this a two-parameter family of minimal surfaces[A, B] 
w= 


where | A, A] are expansions of the original surtace, and | A, 1/A] its continuous 
deforms;* and we may also associate therewith the two-parameter family of 
parallel equilong transformations of the minimal group which permute these 
among themselves. 

The o* points, one from each of these surfaces, with properly parallel 
tangent planes (planes with the same « and v) may be obtained by expanding 
at the origin the conic which is the path-curve of the point of the original 
minimal surface under the continuous transformation which gives the asso- 
ciated surfaces. These points are coplanar, as the plane of the conic contains 
the origin. The tangent planes are not. in general, coincident with this locus 
plane. Under a parallel equilong transformation any aggregate of o* planar 
elements with (properly) parallel planes are rigidly translated as a whole 
(Study and Blaschke) so that an equilong transformation of the group effects 
a translation of this plane. In this plane associated minimal surfaces are 
represented by points of a conic which is one of a one-parameter family of 
homothetic conics. A transformation of the group will translate this conic, 
the transformed conic cutting the original conic and each of the o* homo- 
thetic conics in two points (since the axes of the conies are parallel). Hence 
under any transformation of the group only two associated surfaces of a given 
minimal surface will transform into associated surtaces; the other associated 
surfaces will, in general. be transformed by pairs into associated minimal sur- 
faces of the oo’ expansions of the given minimal surface. 

This discussion may obviously be extended to any system of surfaces whose 
equations are 


(38) w= A > fi lu. v) FM BD gi (u, 
except that the surfaces |.4, 1/4] are not generally continuous deforms. 


As an example we have certain surfaces of Goursat,+ for which the sum of 
the radii of curvature at a point is proportional to the distance from the 


* Although not coextensive, we shall use the expression “continuous deforms” as 
equivalent to “associated surfaces”. This is proper, since a continuous deform is an asso- 
giated surface, or can be made to coincide with one by a congruent transformation 

+American Journal of Mathematics, vol. 10 (1887-8), p. 187: Baroni, Giornale 
di Matematiche, vol. 28 (1890), p. 349. 


1923} EQUILONG TRANSFORMATIONS 483 


origin to the tangent plane at the point. From (6) and (16) the equation of 
these surfaces is 


2k 
(39) (1+ t w|1+ = Q, 


the ratio of proportionality being 2k. (Goursat proved that when (and only 
when) m defined by 


(m+1)(m—2) 
2 


is integral, the solution of (39) may be obtained free from quadratures, and, 
indeed, in the form (38). For these values of k the preceding discussion holds, 
with deletion of the expression ‘continuous deforms”’. 

Two special cases are worthy of note: 

(a) If k = 0, (39) is the differential equation of minimal surfaces; 

(b) If k = —1, (39) is the differential equation of Appell* surfaces tor 
which the projection of the origin on every normal is midway between the 
centers of principal curvature. 

In the papers of Appell and Goursat, we find three classical transformations: 

(a) A transformation of Appell which carries a particular minimal surface 
into an Appell surface. 

(b) A transformation of Appell which carries a particular Bonnet? surface 
into an Appell surface; 

(c) A transformation of Goursat which carries a particular Goursat surface 
into another Goursat surface with change of k. These transformations are 
equilong, and, in fact, special cases of Study’s theorem where the mapping of 
the spherical representations is the identity, and the upper sign for the radical 
is used. 

The general methods of this section are applicable to a large class of sur- 
faces defined by some relation involving their radii of curvature. One further 
group of transformations, defined by a linear non-homogeneous partial 
differential equation, merits attention. From (16) it follows immediately that 


(40) (l+uv)s—up—vq+uwu = —2k 


* American Journal of Mathematics, vol. 10 (1887-8), p. 175. 
7 Paris Comptes Rendus, vol. 42 (1856), p. 119, Note sur les surfaces pour lesquelles 
la somme des deux rayons de courbure principaux est égale au double de la normale. 


484 B. H. BROWN 


is the differential equation of all surfaces for which the sum of the radii of 
principal curvature is a constant 24. Such a surface is, for example, the “inner” 
surface of a sphere of radius /, center at the origin 


(41) k(1+up). 


Thus knowing one particular solution of (40) we obtain all the other solutions 
by adding to the right-hand side of (41) the general solution of the differential 
equation for minimal surfaces. Hence the direct parallel equilong trans- 
formations 

U V W w+ flue). 


where f(u, v) satisfies (40), carry minimal surfaces into the surfaces we are 
considering, and carry surfaces the sum of whose radii of principal curvature 
is 2h, into surfaces the sum of whose radii of principal curvature is 2(k + fk, ). 


DARTMOUTH COLLEGE, 
HANOVER, N. H. 


INVARIANT SETS OF EQUATIONS IN RIEMANN SPACE” 


BY 


PHILIP FRANKLIN 


1. INTRODUCTION 

It is assumed in the theory of relativity that physical quantities are re- 
presented by expressions derived from tensor components, and that the laws 
of nature may be expressed as equations stating the equality of two tensors. 
This assumption is made so as to satisfy the requirement that physical laws 
must be expressible in a form independent of the particular coédrdinates used. 
If we start with this latter assumption, we are tempted to require merely that 
the equations expressing a law of nature be invariant, as a set, under trans- 
formations of codrdinates, and the question arises as tothe relation of equations 
of this type, referred to in the sequel as an invariant set of equations, to the 
tensor equations usually assumed. The requirement of invariance implies that 
there is a law of transformation for the equations in terms of the transforma- 
tion of coérdinates, which will be given if the equations involve merely tensor 
components and the codrdinates. If these quantities enter into the equations 
in a sufficiently simple manner (which, however, is as general as is required 
in most of the equations of physics), we may completely answer the question 
raised above by the theorem 

An invariant set of equations whose members are formed from the components 
of one or more tensors and point functions by addition, multiplication, and 
differentiation with respect to the codrdinates is equivalent to a set of tensor 
equations. 

Here, as throughout this paper, the tensors relate to a Riemann »-space. 

We apply this theorem to the classification of invariant equations involving 
the derivatives of the fundamental quadratic tensor (gi) and show that 

An invariant set of equations involving the derivatives of the gij, the second 
derivatives appearing linearly, and no derivative higher than the second occurring, 
is equivalent to one of five standard tensor equations. 

A theorem nearly equivalent to this for 4-space was given by G. D. Birkhoff.7 
Besides holding for n-space, our discussion is free from the assumptions that 
the equations are homogeneous, which rules out Einstein’s “cosmological 
equation”, and that the coérdinates have a certain reality character. 


* Presented to the Society, April 238, 1923. 
+ Relativity and Modern Physics, Harvard University Press, 1923, pp. 211-220. 
485 84 


486 PHILIP FRANKLIN {October 


The above theorems constitute the chief results of this paper. We proceed 

to the proofs. 
2. EQUATIONS LINEAR IN A SINGLE TENSOR 

We shall begin with the special case of equations involving the components 
of a single tensor, and these linearly, and shall prove that such an invariant 
set is equivalent to a set of tensor equations, the left members of these equa- 
tions being linear combinations of the single tensor given, and tensors easily 
derived from it, with scalar coefficients. To fix the ideas, we shall give the 
details in full only for a tensor of the fourth order: the methods are, however, 
general. 

If we consider our equations at a single point, the coefficients of the tensor 
components, which in general are point functions, become constants. If we 
further introduce normal (orthogonal geodesic) codrdinates at this point, we 
have there 
(1) Yij 0;;. 0gij/Ox, = 0, 


and the transformations of coérdinates which change one such system into 

another are those belonging to the orthogonal group. Since our equations 

remain invariant under all coérdinate transformations, they retain their form 

(as a set) for any linear orthogonal transformation. This leads to the 
LEMMA. An invariant set of equations linear in a single tensor is equivalent 

to a set of equations each of which has the property that any subscript appears an 

odd number of times in every one of its terms, or an even number, perhaps zero. 
We prove this by noting that when we perform the transformation 


(2) (i + 1), 


which corresponds to a reflection in the l-axis, any term containing the 
subscript 1 an odd number of times has its sign changed, while any one con- 
taining it an even number of times is unaffected. Hence if we apply the 
reflection (2) to any one of our equations, we obtain a new equation which is 
a consequence of our given set, from the invariant character, and on being 
added to and subtracted from the original equation gives rise to two equations 
of the type required by the lemma for the subscript 1. By repeating the 
process for the remaining subscripts, we reach the desired result. 

We turn now to the set of equations linear in the components of Patea, the 
tensor of the fourth order. Suppose first that some one of our equations con- 
tains a term with four distinct subscripts, say 1234 (all the subseripts, if in 
four-space, a particular group if in n-space (1 > 4); if in a space of less than 
four dimensions, there are no such terms, and our argument proceeds at once 


1923] INVARIANT SETS OF EQUATIONS 487 


to the place below where terms with subscripts not all distinct are discussed). 
By applying the lemma, we may then obtain an equation in which every term 
contains these four distinct subscripts an odd number of times, and therefore 
each once. Let this equation be 


(3) A + B Pyisa2 
If we define the new tensor 
(4) Qatea = A Pavea + B Pacar + 


we see that (3) gives 


(5) Qiess 


Since our set of equations is invariant, (3) and hence (5) holds in all systems 
of coérdinates. In particular, since we may transform the coérdinates by 
a permutation of the variables, we see that all the components of Quaa with 
four distinct subscripts vanish. 

We shall obtain the tensor equation which follows from our set by deter- 
mining constants A, B, ete. for which the equation 


Qavea == Sav (A Qiiea + B Qiiae + C Qicai + D Qiaci + E Qicia + F Qiaic 


Qeait + H Qacii + I Qeita + JI Qaiie + K Qeiai + LQaici) 
+ dae ( A’ Qiina + B’ Qiian + 

+ Sea (U + V + W + 

+ Bae Sva (U' + Qui + W' + 


is satisfied. 

This equation holds regardless of the values of the constants, provided all 
four subscripts are distinct, in virtue of (5) and the similar equations. To see 
what conditions must be satisfied when two subscripts become equal, note that 
the equations 


“LL = 00SO—2e sin 
(7) 


, 


0. 

(i 1,2) 
84* 


488 PHILIP FRANKLIN [October 


define an admissible transformation, and therefore our set of equations will 
hold after this transformation is applied. On applying it to (5), we obtain an 
equation in tan @ true for all values of 6. Consequently the coefficients must 
vanish, giving the new equation 


(8) Qiisa = 


On applying the rotation similar to (7), but involving the 1 and 3-axes, to 
equation (8), which likewise holds for all systems of codrdinates, multiplying 
the right side by cos*@-+ sin? @ to make the equation homogeneous, equating 
the coefficients to zero, and using (5), (8) and the similar equations, we find 


If we now set abcd = 1123 in (6), and make use of (8), (9) and the similar 
equations, we obtain 


Qiies = Ques A+ D+ L)4+ Qo (D+ F4+nG4+14+ K) 
+ Quise Qs (C +£E+nH+Jd+4+L) 
+ (B+ nC+ E+ H+ K)+ (A+ K) 
+ Qisn (A+nD+F4+64+ L)+ L) 
+ H+ J)4+ (B+C+6+1+4+ 7K) 


+ (A+D+H+J+nZL). 


The equations obtained by making this an identity in the Q’s and equating 
coefficients have a determinant equal to n* (n—-2)°(n-+ 4) (m+ which 
is different from zero (since n = 4, to make this part of the argument neces- 
sary). Hence the equations have a solution, and when they are solved and 
the result is substituted in (6) it will hold for abcd = 1123. It will also 
hold for any choice of the subscripts making the first two equal, and the 
remaining two distinct from these and from each other, as is evident from the 
equations determining the constants. 

In an entirely analogous way, we may determine the values of A’, B’, ..., L 
so that the equation (6) will hold when a and ¢ are equal, } and d being distinct 
from these and from each other; and then A”,..., A’”...., Avi, ... so that 
the equation will hold when any pair of subscripts are equal, the remaining 


1923] INVARIANT SETS OF EQUATIONS 429 


pair being distinct from these and each other. It then follows from (9) and 
the similar equations that (6) holds when three of the subscripts are equal, 
but different from the fourth. 

We now define a tensor obtained from Qatca by subtracting the terms already 
determined : 


(1 1) Savea = Qabed A Qiica Seca Qoiai- 


From the method we used to determine the constants appearing in this equation, 
it follows that all the components of Sagvca containing one of the subscripts an 
odd number of times vanish. Furthermore, by contracting (11) we may express 
the contracted S’s in terms of the contracted Q’s. Consequently we may find 
U, V, W ete. to satisfy (6) provided we can find constants which satisfy 


(12) Satcea = Var Sea (X Sijj + + + ---- 
We already know that the equations 
(13) = 
(14) Si123 = 
(15) Siu = 
hold, as well as the similar equations obtained from them by permuting the 


subscripts. On applying the rotation similar to (7) involving the 2- and 3-axes 
to (14), and noting that we have an identity in 6, we find 


(16) Si122 = Stiss; 
and hence 


(17) == 


Again, by applying the rotation involving the 1- and 4-axes, and using the 
equations already set down, we find 


(18) + + Stata, 
and hence, by (16), 
(19) = 


490 PHILIP FRANKLIN [October 


If we now set abcd = 1122 in (12) and utilize the relations just derived, 
we find that 


(n? X + n Y+ nZ) + (nX+ n®*Y+ n Z) 
20) 
+ Store (nX+nVY+ n?Z). 


The equations obtained by considering this equation an identity in S have 
a determinant equal to n* (n+ 2)(n—1)’, different from zero (n => 2, since 
we have two distinct subscripts). These equations may thus be solved for X, 
Y and Z. Similarly we determine X’, Y’, Z’ so that (12) holds for S,.s;, and 
X”",Y", Z” so that it holds for S,,.. It then follows from the way these 
coefficients are determined that equation (12) holds for any component whose 
subscripts form two pairs of equal elements, all four not being the same. This 
last case, however, is covered by recalling (18) and noticing the form of (12). 

Having thus determined the coefficients to satisfy (12), which is now true 
for all subscripts, we combine (12), (11) and the equations obtained by con- 
tracting (11) so as to get an equation of the form (6) which is true for all 
subscripts. On eliminating Quoca from this equation by means of (4) we obtain 
a tensor equation in Paya Which is implied by our original set. If there are 
any other equations in the set in four distinct subscripts, which are not con- 
sequences of the tensor equation just obtained, we may apply the process 
again to get an additional tensor equation in Posed. We may keep this up 
until the tensor equations obtained have as consequences all the equations of 
our set with four distinct subscripts. This will have to happen after at most 4! 
such tensor equations have been obtained, since from this number we could 
solve for all the components Pi234, Piss, ete., and eliminate them from our 
original set. 

When we have obtained these tensor equations, and removed from our 
original set of equations all those which are consequences of the tensor equa- 
tions, we shall have a second set of equations. This residual set has the 
property that no member contains four distinct subscripts. If one of these 
equations contains one, and hence two indices (say 2,3) each an odd number 
of times, by applying the lemma we may obtain from it an equation in which 
every term contains these indices an odd number of times, and hence of the 
form 


(21) M1123 + +++ + Paves = 


wheré the m’s are mere numbers, coefficients of the components indicated by 
their subscripts. 


1923] INVARIANT SETS OF EQUATIONS 49] 


On applying the rotation similar to (7) involving the 1- and 4-axes, to 
(21) and equating the coefficient of sin 6 to 0, we find 


(14423 — 4193) Pyg93 + terms in less than 4 subscripts = 0. 
If the coefficient of P42, is not zero, the equation 


(23) Pr423 = 0, 
and hence (ef. (8)) 
(24) P4423 


will be included in the consequences of our tensor equations derived from 
those involving four subscripts, and hence in view of (24) we can make 


(25) = M1198 


in all cases. 
If we next apply (7), involving the 1- and 2-axes, to (21), we find, as the 
condition for the coefficient of sin @ cos?6@ vanishing, 


(26) (12293 —— 4123 — M218) + Megas Praag = 


As this is of type (21), 1 and 3 being the odd subscripts, we obtain the 
relation, analogous to (25), 


(27) M4123 — Megas; 
or, in view of the relations similar to (25), 
(28) M2993 == Mai13 + 
In consequence of equations (25), (28) and similar equations, we may write 


(21) in the form 


(29) 24123 Piiss + My231 Piz3i = 0, 
which shows that the tensor 


(30) Tub = M1293 Pitas + M231 Piavi + 


49? PHILIP FRANKLIN [October 


has all its components in two distinct subscripts zero, and by the method used 
above for Qavea, We may show that Ty,» satisfies the tensor equation 


(31) Tor = Tii. 


On eliminating 7,» from (31) by means of (30) we obtain a tensor equation 
in Puvca. In the same way we obtain all possible tensor equations which result 
from the equations of our set with two distinct subscripts. We then reject all 
equations from the set which are consequences of any of the tensor equations 
so far obtained. The equations which remain, since they involve no subscript 
an odd number of times, must be of the form 


(32) Prise man Praia = 0. 


On applying the rotation similar to (7) involving the 1- and 3-axes, we find, 
as the coefficient of sin @ cos* 6, 


(33) (122 — Magee) + — Mgi13 — — M138) Pris + = 0. 


By an argument similar to that used above to establish (25) we may show 
that the coefficients here either are zero, or can be made zero, giving 


(34) — 

(35) M11 = + + 
Consequently we may write (32) in the form 

(36) + Mon Pit + --- +M = O,z 


M being a constant term. This is a tensor equation, and by obtaining all such 
equations from our set we will finally have a collection of tensor equations 
which is equivalent to our set, in the sense that it implies and is implied by 
the set. These equations are either of type (6), (31) using (30), or (36). All 
these may be included in a single form, 


(37) A Patea + Boa» Piica + Sea Piijj --- + bea D = 0. 


1923] INVARIANT SETS OP EQUATIONS 493 


The coefficients in this equation are constants at the point under consider- 
ation, and the equation holds for normal codrdinates at this point. As it is 
evidently equivalent to the equation in general codrdinates 


(38) A Paved + Bayar + Yea Piijj + gar yea D = 0, 


this may now be considered to hold at all points, if instead of regarding the 
coefficients as constants, we regard them as scalar point functions. 

As the argument given above for tensors of the fourth order may evidently be 
extended to those of any order, the mth, the process consisting in first deducing 
tensor equations from those in the set in m distinct subscripts, then m—2, 
and so on, we may state as the conclusion of this section 

THEOREM I. An invariant set of equations, linear in the components of 
a single tensor, is equivalent to a set of tensor equations, obtained by equating 
to zero linear combinations of the given tensor, those obtained from it by per- 
muting the subscripts, and those obtained by contracting one or more times and 
multiplying by the fundamental quadratic tensor so as to bring the order to its 
original value. The coefficients in the linear relations are scalar point functions. 


3. EXTENSION TO THE GENERAL CASE 


The previous section merely dealt with equations linear in the components 
of a single tensor; it is, however, easy to extend the result there obtained to 
the case of equations formed from several tensors, differentiation and multi- 
plication being admitted. This extension now concerns us. 

Consider first the case where the equations are linear in the components of 
several tensors, not necessarily of the same order, so that each term involves 
only a component of one of the tensors. We confine our attention to one point, 
and introduce normal coérdinates. The lemma of the preceding section 
evidently applies here, and by its use we may reduce our set to one in which 
each equation contains any one subscript an odd number of times, or an even 
number of times. Hence the tensors appearing in these equations must have 
orders differing by an even number, and they may all be brought up to the 
same order by introducing 6’s with equal numerical subscripts, which does not 
effect the invariant character of the set. Having done this, we may now select 
the equation of the set containing the greatest number of distinct subscripts, 
as we did in the preceding paragraph, and proceed to deduce tensor equations 
which follow from our set, exactly as before. 

Next, if the equations of the set express the vanishing of expressions in- 
volving the components of several tensors to a degree higher than the first, 
being polynomials in these components, we have merely to regard the product 


494 PHILIP FRANKLIN [October 


of two components of the same, or of two different tensors, as a single com- 
ponent of a tensor whose order is the sum of the orders of the two used in 
forming it, to reduce this to the case just discussed. 

Finally we consider the case where differentiation of the tensor components 
is permitted. We notice that we are dealing with normal coérdinates in 
carrying out our reduction, and in these coérdinates the first derivatives may 
be replaced by the corresponding covariant derivatives, while the highe1 
derivatives may be replaced by polynomials in the higher covariant derivatives, 
and the derivatives of the Christoffel symbols in normal coérdinates, evaluated 
at the origin. But these quantities are all tensors,* and thus the set of equa- 
tions implies a set holding in normal coérdinates which may be treated by 
methods already given. 

As an illustration of the reduction of the last three paragraphs, suppose 
one of our original equations were 


(39) (Wie)? = + (Nig) (07, /8 29). 


By applying the lemma, we should obtain, after replacing the derivative by 
a covariant derivative, 
(Vie) (Tie) = 0, 
(40) 
— = 0, 


and on introducing the tensors defined by 


Qabed Van Teja: 
(41) 


Qideder = Wap Wea We Dee Oar Ua, 
we should obtain 
= U, 
(42) 
== 0, 


a form to which the result of the preceding section would apply. 

We have thus proved the theorem stated in the introduction: 

THEOREM Il. An invariant set of equations, obtained by equating to zero 
expressions formed from one or more given tensors and point functions by 


* For the expression of the derivatives of the Christoffel symbols at the origin of a system 
of normal coérdinates in terms of the curvature tensor, see 0. Veblen, Proceedings of 
the National Academy of Sciences, vol. 8 (1922), p. 196. 


1923] INVARIANT SETS OF EQUATIONS 495 


addition, multiplication, and differentiation with respect to the codrdinates, is 
equivalent to a set of tensor equations. 


4. EQUATIONS LINEAR IN THE SECOND DERIVATIVES OF THE 
FUNDAMENTAL QUADRATIC TENSOR 

In this section we shall consider the classification of invariant sets of equa- 
tions, formed by equating to zero expressions involving the fundamental 
quadratic tensor, gi, its first and second derivatives, and these last linearly. 
Such equations are of interest, since the equations holding in space free of 
matter are of this form, so that on specializing our results to four-space they 
will throw light on the choice of equations for the relativistic theory of 
gravitation. The question is related to one concerning possible tensors of 
this type. which was previously taken up by the author.* The results of that 
paper, while related to those here obtained, neither follow from them nor 
lead to them. 

As our invariant set of equations involve merely the gj, and their first and 
second derivatives, if we introduce normal coérdinates, for which (1) holds, 
they will reduce to expressions in the second derivatives, and as these (in 
normal coérdinates) are expressible in terms of the curvature tensor, we see 
that our equations involve this tensor only. Also, on account of the linearity 
requirement, they involve its components linearly, so that they are the type 
discussed in Section 2. By that section, we see that our equations must all be 
of the form 

A Ratea + B Raate + Rea + D Yea Rar + E gac Roa + F Qva Ra. 
(43) H gaa Rie + Raa + J gav Yea R-+- K Joa R 


L gaa M gar + Ngac goa + = 


The terms in (38) omitted in this equation depend on the ones kept, owing to 
the symmetry relations of Rasea, and we use the customary notation for the 
tensors obtained from it by contraction. 

As we may be able to factor out the fundamental quadratic tensor from (43), 
in case every term contains a gq, Say, — i. e., if all the coefficients were zero 
except C,.J, M/,-— the factoring being accomplished by contracting with respect 
to a and b, the equation may reduce to one of the second order, or to a sealar 
relation. We shall treat these simpler cases first. The scalar relation will 
evidently reduce to the form 
(44) R= M, 


M being a scalar, perhaps zero. 


* Philosophical Magazine, vol. 45 (1923), p. 998 ff. Cf. H.Wey], Raum, Zeit, Materie, 
fourth edition, p. 287. 


496 PHILIP FRANKLIN [October 


[If the equation reduces to one of the second order, it will be of the form 
(45) CRatJgeaR+ Ugca = 
On contracting this with respect to « and d, we obtain 
(46) = OU, 


n being the dimensionality of the space. If C is zero, this either gives an 
equation of type (44), having (45) as a consequence, or all the constants are 
zero and (45) is satisfied identically. If C + 0. on replacing JR-+ W by 
—C/n in (45), and dividing out C, we find 


1 


as the standard form for a tensor equation of the second order. 

We may show that if, in (43), both A and B are zero, that equation is equi- 
valent to one of type (44) or (47). For in that case, on contracting with 
respect to a and b, we would obtain 


(48) 
P)gcea = 0. 


If the coefficient of Ria in (48) is different from zero, it leads to an equation 
of type (47), by means of which we may eliminate all the R’s with two sub- 
scripts from (43), the resulting equation easily reducing to a relation like (44). 
As these two equations of our earlier types would have (43) as a consequence, 
nothing new results from this case. If the coefficient of R.¢ in (48) were zero, 
we could contract (43) with respect to a second pair of subscripts and carry 
out the argument as before, unless all six coefficients were zero, in which case 
we would have C= D=E=F H = I = 0, and (43) would be 
essentially a scalar equation like (44). 
When A and B are not both zero, by using the relation 


(49) Ratea + Raave + Racao = 0 
we can obtain a relation involving only one of these. For, since (43) is true 


for all codrdinates, and hence in particular when we permute the subscripts, 
it gives, on interchanging b and c, 


(90) A Racta + B + = 0, 


1923] INVARIANT SETS OF EQUATIONS 497 


or, using the symmetry properties of Rated. 

(51) A Racay— BRadve + --- = 

On subtracting this equation from (43), and using (49), we find 
(52) (2B—A) Raaee+ --- = 


If the coefficient is zero, we interchange the réles of A and B in the argument, 
and as they are not both zero, we see that in all cases an equation of form (43) 
is obtained containing a single R with four subscripts, with non-vanishing 
coefficient. Equation (52) may thus be solved for Rage, and if we use this 
value in the left member of the identity 


(53) Reade Roeaca Ravea Ravae 4 Ravea 


which follows from the symmetry properties of Ravca, the resulting equation 
takes the form 
A Ratea + B (Jae Rvat+ Joa Rac — Jad Roe — Joe Raa) 
(54) 
+ C(Gac — Yaa Joe) + D(gac Joa ~~ Yaa 


Furthermore, (43) can not imply anything more than (54) unless perhaps 
equations of our earlier types, since we can eliminate all the terms with R’s 
of four subscripts from (43) by using (54), coming back to the case where 
A and B in (43) are both zero. 

On contracting (54) with respect to a and d, we find 


(55) (A+ (2 —n) B) Rie + (— B+ 1 — ge R+ DU — 2) = 
If the first coefficient here is not zero, we may express Ap, in terms of goc R 
and ge, and consequently modify B at pleasure in (54), provided we make 
corresponding changes in C and D. Thus we may assume in all cases 

(56) A+(2—n)B = 0. 

Similarly, if the second coefficient is not zero, we may, by a second contraction, 
express # as a scalar, and hence change C in (49) by making the proper 


changes in J), so as to get in all cases 


(57) —B+(1—n)C = 0. 


498 PHILIP FRANKLIN [October 


Finally, when these changes have been made, if necessary, (55) gives by 
contraction 


(58) = 0. 


Solving (56), (57) and (58) for the constants in terms of A, and then dividing 
out by A, which is not zero, we reduce (54) to the form 


Ravea + (gac- Roa + goa Rac — Jaa Roe — Yoe Raa) 


(59) 
1 
— (gae Joa — Jaa Joe) R = O. 
(m2) (nm Jaa Joe) 
In case any of the transformations of constants were necessary to make the 
coefficients of (55) vanish, (54) implies, in addition to (59), equations of type 
(44) and (47), but nothing further in any case. 

Before considering the possibilities of combining these three types of equa- 
tion, we shall prove that an equation of type (47) always leads to a special 
case of one of type (44), that in which the right member is a constant.* If (47) 
holds, we have 


] 
(60) Rin — (Rg’),, = 9, 


denoting covariant derivatives in the usual way. On the other hand, it is 
well known, and easily proved by calculation in geodesic coérdinates, that 


identically. which shows that, unless n = 
(62) (Rg), = OR/dx, = 0. R= h,. 


When n = 2, (47) holds identically, so that from now on we need only con- 
sider the combination of (62) and (47). 

In view of the preceding results, we see that all sets of equations of the 
kind discussed in this section are equivalent to one of the following five 
tensor equations: 


“G. Herglotz, Leipziger Berichte, vol. 68 (1916), p. 203; cf. also G. D. Birkhoff. 
loc. cit., p. 220. 


INVARIANT SETS OF EQUATIONS 


(I) R — M, 


1 
(I) Ry Yao k, 0, 


1 
5 (Gac Rya Iva Jaa R 


be 


(n —2)(n—1) Ira Jaa Ive) = Y, 


l 
(IV) = (II), Iva Iaa Ive) = 0, 


n(n- 


2 > 
(V ) (111), (1) n 9 (Gace kya ac Jaa h,,- Ive 


1 
1) Joa ~~ Jaa Foe) M 
Here, as indicated, (IV) is equivalent to (IIT) and (11); while (V) is equivalent 
to (III) and (1). It is evident that (IV) follows from (II]) and (II), and to 
deduce these from (IV) we have merely to contract (IV), obtaining (IT), which 
then enables us to reduce (IV) to (III). We may handle (V) similarly. Since 
we have shown that (II) always implies the special case of (1) 


(64) R Ry, 


these are all the combinations that we need consider. It should be particularly 
observed in the above that Ay is a numerical constant, while / is a scalar 
point function. Thus there is always some value of M for which (1) holds — it 
only becomes a condition when the form of 1 is given. 

The equations above given may be interpreted geometrically. (1) states the 
vflue of the total curvature, and evidently only derives special significance 
when MM is specialized. (11) is the condition that the “principal directions” 
become indeterminate at every point of the space.* (II]) is a necessary and 
sufficient condition (2 > 3) that the space be conformally representable on 


*L. P. Eisenhart, Proceedings of the National Academy of Sciences. vol. 8 
(1922), p. 24. 


1923] ee 499 

(63) 


500 PHILIP FRANKLIN 


a euclidean space.* Its left member occurs in various investigations on con- 
formal representation, and has been called the “conform curvature”. (IV) is 
the condition that the space be “spherical”’.7 

As stated previously, the equations holding in space time which are the 
analogues of Laplace’s equation in the Newtonian theory of gravitation for 
the theory of Einstein must be of the type under discussion. Thus they must 
be one of the equations given in (63). Since (I) is not restrictive enough, 
and (III) is too restrictive, it follows that the only possible equation is (IJ), 
that selected by Einstein. 

Recapitulating the work of this section, we have proved 

THEOREM III. Every set of invariant equations formed by equating to zero 
expressions involving the fundamental quadratic tensor, g,,, its Jirst and second 
derivatives, and these last linearly, is equivalent to a tensor equation of one of 
the five types given in (63) above. 


H. Weyl, Mathematische Zeitschrift, vol. 2 (1918), p. 404; J. A. Schouten, 
Mathematische Zeitschrift, vol. 11 (1921), pp. 58ff.; cf. also H. W. Brinkmann, Pro- 
ceedings of the National Academy of Sciences, vol. 9 (1923), p. 1, p. 172. 

+ Schouten, loc. cit., p. 75. 


HARVARD UNIVERSITY, 
CAMBRIDGE, MASS. 


SOME PROPERTIES OF SPHERICAL CURVES, WITH 
APPLICATIONS TO THE GYROSCOPE* 


BY 


0. D. KELLOGG 


In his paper entitled On the gyroscope+, Professor Osgood has reduced to 
the utmost simplicity certain important aspects of the theory of the motion 
of a rigid body which is dynamically symmetric about an axis through the 
center of mass. The present study derives its inspiration from his article. 
A fundamental réle is there played by the geodesic curvature of the trace 
on the unit sphere with center at the fixed point, or at the center of mass, 
of the point where this sphere is cut by the axis of symmetry, and this 
fact suggests that it might be desirable to have on hand more information 
about the intrinsic properties of spherical curves. 

Accordingly, the first part of the present paper is devoted to the development 
of a number of such properties. The subject, however, is not without interest 
for its own sake, and suggests numerous extensions and related problems; for 
instance, a systematic study of the relationships of certain properties of the 
curvature as a function of the length of are with the corresponding geometric 
properties of the curve. One such property which has already received 
a good deal of attention is the “four vertex theorem” (Vierscheitelsatz) 
for plane ovalst. Among other questions that might be raised are such 
simple ones as the following: what functional character of the curvature, 
in addition to periodicity, insures a closed curve on the sphere? When is 
a plane, or spherical curve, asymptotic to some closed curve? What inter- 
esting comparison theorems are there for pairs of curves whose curvatures 
stand in simple relationships of equality or inequality? And there is the 
further question to be considered of curves on more general surfaces. 

The second part of the paper makes application of the results of the first 
part to the theory of the gyroscope. While some apparently new facts are 
there brought to light, mention should be made of a new way of establishing 


* Presented to the Society, April 28, 1923. 

ft These Transactions, vol. 23 (1922), pp. 240-264. This paper will be referred to 
hereafter by the initial O. 

tSee, for instance, Blaschke, Vorlesungen iiber Differentialgeometrie, vol. I, 1921, p. 16. 


501 


502 0. D. KELLOGG (October 


some classical results. It will be recalled* that certain theorems on the 
sense of precession of the gyroscope, and inequalities on the longitudinal 
motion of the spherical pendulum, have hitherto required the use of the theory 
of elliptic functions, or of Cauchy’s integral theorem, for their establishment, 
while others depend simply on the appraisal (Abschatzung) of definite inte- 
grals. It turns out that exactly those results which have previously required 
the less elementary methods are simple consequences of our present geometric 
theorems. 

In addition to the applications mentioned, there will be found in the 
second part of the paper certain further results and formulas which may 
well be of use in connection with problems on the gyroscope. 


PART I. ON CERTAIN INTRINSIC PROPERTIES OF CURVES 


1, Plane curves whose curvatures approach limits as the arc 
lengths increase indefinitely. The need of care in the study of qualitative 
properties of curves is illustrated by a loose statement in the authorized 
German translation of Cesaro’s excellent book on intrinsic geometry.+ In the 
discussion of plane curves the following sentences appear, in which ¢ is the 
radius of curvature, and g the angle between a fixed line and the tangent at 
the point corresponding to the are length s: “Nur wenn s zusammen mit 9 
unbegrenzt zunimmt, kann es vorkommen, dab @ sich einem von Null ver- 
schiedenen Grenzwerte a nihert. Alsdann windet sich die Kurve, anstatt sich 
um einen Punkt herumzuwickeln, asymptotisch um einen Kreis vom Radius a, 
und zwar innerhalb oder auferhalb desselben, je nachdem der absolute Betrag 
von @ sich oberhalb oder unterhalb seines Grenzwertes hiilt.” The following 
example shows that from the knowledge that @ approaches a limit as s becomes 
infinite we cannot infer that an asymptotic circle exists. 

Let s, denote the sum of the first n terms of the harmonic series, s, = 1 
+1/2+1/3+ --- +1/n. We describe about each point (s,, 0) of the 
x, y-plane, a circle of radius a. We then erase the upper halves of these 
circles, and unite the lower halves toa single continuous curve, with continuous 
curvature, by joining the two most distant extremities of each pair of succes- 
sive semicircles by an arch of an ellipse with transverse axis along the x-axis, 
so chosen as to make the curvature continous. Thus, the first arch has its 
major axis terminating in the points (1—a, 0) and (3/2 +a, 0), the second, 
in (3/2—a, 0) and (11/6 + a, 0), and so on. 


* See 0, p. 260, end of page, and Appell, Traité de Mécanique Rationnelle, vol. I, Paris, 
1902, p. 501. 
T Vorlesungen iiber natiirliche Geometrie, Leipzig, 1901, p. 12. 


1923] CURVATURE AND THE TOP 503 


Here the curvature is seen to approach the limit 1/a, but no asymptotic 
circle exists. The example can evidently be modified to bring out a number 
of different facts. For instance, the initial set of circles may be taken with 
their centers on a closed curve, say a large circle. We shall then have 
a bounded curve exhibiting the same lack of an asymptotic circle. 

A statement that can be made, however, is the following: 


THEOREM I. Jf a plane curve has curvature, K, which is a continuous 
Junction of the are length, s, and if, as s becomes infinite, K approaches 
a limit different from 0 while always increasing or always decreasing, then the 
curve has an asymptotic circle, approached from without, or within, respectively. 

As the proof of this theorem is analogous to that of a later one on spherical 
curves (p. 509), it will not be given. Instead, we shall prove a theorem which 
is essentially broader in its hypothesis, and whose analogue for spherical 
curves is neither so simple of treatment, nor so interesting from the point of 
view of dynamics: 


THEOREM II. Jf there exist two constants, e>0, and so, such that for 
S<s<o, K(s)>a, and if K(s) is of bounded variation, then the curve 
defined by K = K(s) has an asymptotic circle. 

A consideration of the total variation, ¢(s), of K(s), makes it ob- 
vious that K(s) must approach a limit, k. Moreover, if t(s) is the angle 
which the tangent to ©, the curve under consideration, makes with a fixed 
direction, then K is also of bounded variation when considered a function of t 


(. (s) =| K(s) ds ). Now the coérdinates, x(s), y(s), of P, on ©, referred 


to appropriate axes, are given by 


{ cost -f sint 
Ife 


From these formulas it is not difficult to show that the successive maxima and 
minima of « and y approach limits, and that the mean of these limits for x, 
and the mean for y, give the coérdinates of the center of an asymptotic circle. 

The maxima, x;, of x occur for r = (2n-+3)2, and the minima, zx}, for 
«= (2n+3)a. If the interval of integration in the expression for x7 be 
subdivided at these points, we may use the law of the mean in each sub-interval, 
and write 


/ 
85* 


0. D. KELLOGG [October 


(ant 
Cost 
dt = 


K(t) 


l 2 


2 g 


1 


Kt) — 2{[1/K(«,) — 1/K(t)] + [1/K (13) — 


[1/K (t2»-1) 1/K(t2n)]}, 


where ...+, T2n are appropriate mean values of 

Now, since K(r) >a@>0, 1/K(r) is of bounded variation with K(r), for 
|1/K(b)— 1/K(a)| <| K(b)— K(a)!|/a®. Hence the terms in the braces are 
the first n terms of an absolutely convergent series. Accordingly, x; approaches 
a limit, 2”. Similarly, yx and approach limits, 2’, y”, andy’. Let us 
denote by a and b the means, (2’+«")/2 and (y’+ y”)/2, of these limits, 

and by the limit, 1/k, of Then, since dt and 
taken over any interval whose end points correspond to successive integral 
multiples of 7/2, approach 7, or —7r, according to the interval selected, we 
see that the points of € of maximum abscissas, maximum ordinates, minimum. 
abscissas, and minimum ordinates, approach (a+ 7,b), (a,b-+7), (a—r, b) 
and (a, b—7), respectively. Let 7, be a number, corresponding to a given 
é>0, such that for r>7,, the differences between these four variables and 
their limits are numerically less than ¢, and also that | 1/K(r) — 1/k| <e. 
This means that for r > 7, the extremes of the codrdinates of € differ by less 
than ¢ from the corresponding extremes for the circle C: X = a+ r sinz, 
Y=b—vrcosr. From this we infer that for any point of © for r>7,. 


a(t)— X(r) <et+ — 1/k] cost) dt < 2e, 


where the interval of integration is from the greatest multiple of 7/2 less than z, 
to A similar inequality will hold for Y(v). Thus € ultimately 
lies entirely in any region containing the circumference C in its interior; 
that is, € approaches C in the weak sense. It is easily shown, however, that 
the approach has also the stronger sense, namely, that the difference of the 
direction angles, tr and r’ of € and C, at two points, one on each curve, also 


504 
(i+$)= 
ix=2n—1 
cost cost 
{ Ku) dr + p> Ki) dt | 
0 0 (i+2)x 


1923} CURVATURE AND THE TOP 505 


approaches 0 has t increases indefinitely and the two points approach coin- 
cidence. This is because of the constant curvature of C, which has as conse- 
quence the finite distance apart of any pair of points at which the directions 
of C differ by a finite amount. The required inequalities may readily be 
supplied, and the proof of the theorem thus completed. 

2. Spherical curves and spherical evolutes. The most convenient 
analytical tool for the present study appears to be the vector. We shall denote 
vectors by Clarendon letters, a, b, P, etc., and employ the Gibbs notation, a-b 
for the sealar product, @ x b for the vector product, and (a, b,c) = a.(bxc) 
for the triple product. The vector algebra required goes little beyond the 
distributive laws, and the expansion formula @ x (bx c) = (a-c)b — (a-b)ec. 
Primes will be used for one purpose only, namely, to denote derivatives with 
respect to the are-length, s, of ©. Unit vectors along the coérdinate axes will 
be denoted by i, 7, k, and their senses we shall assume fixed once for all, and 
so related to the definition of vector product that (i,j,k) = +1. The 
magnitude of a vector will be denoted by the corresponding italic letter. 

Let the curve € lie on the unit sphere, S, and let a variable point of © be 
characterized by the unit position vector, P= xi + yj+zk, with origin at 0, 
the center of S. We shall restrict ourselves to curves such that x, y and z 
have continuous derivatives with respect to s of the first three, and, in some 
cases, the first four orders. From P are derived the tangent vector 


(1) T = P 


and the curvature vector 
(2) K = T’ = 


For these we have the relations 
(3) 
since € is on the unit sphere, and 


(4) 
since (P’)? = (ds‘ds)? = 1. 
Furthermore, by differentiating these relations, we find 


(5) P-T <= @, 


(6) T-K = 0. 


506 0. D. KELLOGG [October 


and by further differentiation, 


(7) = —T? —1, 
(8) P-K’ = —T-K = 0, 
(9) = —K?. 


The unit principal normal vector, N, of € will coincide in direction with K 
We shall give it the same sense, so that 


(10) N = K/E = ef, 


where K and g are the curvature and radius of curvature of ©. Formula (7) 
shows that K always makes an obtuse angle with the position vector, and that 
its magnitude, the curvature, is never less than 1. 

The unit binormal vector is defined as 


(11) B= TxN., 


If the initial point of B be placed at O, its tip will mark a point, Q, on S 
which is the spherical center of curvature of € for the point P (the tip of P). 
We define the quantity R, 0 R<za, by the equations 


(12) sinR = a, cosR = P-B, 


and call it the spherical radius of curvature. The same term will be used 
occasionally for the great circle arc, QP, of which it is the length. If, with Q 
as center, and with spherical radius &, a circle be described on the sphere, 
this will be the osculating circle of € at P. 

As P moves along €, Q will, in general, trace a curve, ©, the spherical 
evolute of ©. The point diametrically opposite Q will trace a symmetric curve, 
similarly related to ©. But the particular evolute here defined is selected by 
the sense given to the binormal vector. A reversal of the sense of increasing 
s on © would interchange the evolutes. 

The Frenet formulas take the form 


N 


1923] CURVATURE AND THE TOP 507 


where 1/z is the torsion of ©. From the third Frenet formula, and equations 
(12), (10) and (7), we see that the torsion is the negative of the derivative 
with respect to s of the spherical radius of curvature, 


(14) R= ——. 


The torsion may be positive or negative; if the axis system is a right hand 
one, the curve, at a point where t > 0, deviates from its osculating circle by 
bending to the left as one follows € in the sense of increasing s, with head in 
the direction in which P points. 

The geodesic curvature, x, is simply related to R. The great circle tangent 
to € at Phas, as unit normal, the vector Px T, and the absolute value of x 
is the magnitude of the derivative with respect to s of this vector, that is, 
by (2), V(Px K)*. This reduces, with the help of (3) and(7), toV K?—1. 
In terms of R, this is | cot R!. It will be convenient to give x a sign, so we 
shall identify it with cot R*, 


(15) x = cotR. 


Thus, if a right hand system of axes is postulated, x > 0 when € turns toward 
the left of its tangent great circle. 
Combining (14) and (15), we have anew Professor Haskins’ result (O, p. 248): 


—— tan} x = 


ds 


3. Spherical curves. Osculating and asymptotic circles. For the 
purposes of applications to problems in dynamics, where codrdinates are, in 
general, analytic functions of the time, and of are length of paths, it will 
be adequate to consider curves whose curvatures are, in given intervals, 
either always increasing, or always decreasing functions of s. Accordingly 
we shall assume in this section, unless the contrary is stated, that in the 
open interval considered, s)<s<s,, K’ is either always positive, or else 
always negative. The fact that K is never negative permits the inequality 
K'’>0 to embrace two symmetric types of curves between which it is 


*See O, p. 248. The @ on the page referred to is the spherical radius of curvature of 
the intersection of the cone with the sphere. 


508 0. D. KELLOGG [October 


desirable to distinguish. We have, indeed, e’<0, but the first equation (12) 
shows that cos RR’<0, so that Rk’ ] 0 according as R27/2. A similar 
situation obtains with respect to K’< 0. Greater clarity will be attained when 
it is possible to think of the spherical center of curvature as the nearer of the 
two points which may play the role, so that in case R> 2/2 for the are 
S<s < s,, of ©, we shall replace € by the curve symmetric to it with respect 
to some diametral plane. The vectors P, T and K will thus go over into 
symmetrically placed vectors, but B will go over into the symmetric vector 
with sense reversed, so that the sign of cos R will be changed. It will be 
noticed that the hypothesis that K’ is always positive, or else always negative, 
on an are precludes the possibility of an osculating great circle at an interior 
point of the are. 
The third Frenet formula (13) becomes, with the help of (14), 


(16) 


Accordingly, if o represents the are length of ©, increasing with increasing s, 
the equality of magnitudes and directions in this vector equation yields the 
result, for the case R’< 0: 

THEOREM III. The tangent to the spherical evolute E at Q has, for direction, 


the initial direction of the shorter great circle are from Q to P, and the 
differential of arc of E is given by 


(17) do = —dk. 


Thus, if a flexible inextensible string be unwound from a metal guide having 
the form of the curve ©, the string being kept taut and in contact with the 
sphere, one of the points of the string will trace the curve ©, so that in this 
sense the name evolute for € is appropriate. 

The following property of the evolute will also prove useful: 

THEOREM IV. No arc of E on which R' is always positive, or always negative, 
is an are of a great circle. 

Suppose, contrary to the theorem, that €, for s’<s<s”, is an are of 
a great circle. Then, if A denote a constant vector perpendicular to the plane 
of this are, we have A-B = 0. If this equation be differentiated, the result 
reduces, by (16), to A-N == 0, since R’-E 0. But A-N = Oand A-B = 0 
imply that T is parallel with A, a constant vector, which is impossible for 
a spherical curve. 

We may now infer some properties of a given spherical curve, ©. The 
first is 


B’ = 


1923] CURVATURE AND THE TOP 509 


THEOREM V. Jf € is an arc of ever increasing curvature, K'>0, s'<s<s", 
any osculating circle Cs(s = s,) lies entirely within any osculating circle 
= for which 8". Here “within” means in that one of 
the two open regions, into which the circle divides the spherical surface, with 
the less area. 

The theorem is an immediate consequence of equation (17), which, inte- 
grated from s, to s2, yields o,. = R,— R:, where o, is the total length of € 
between the points characterized by s, and s,. Here the vital significance of 
the hypothesis of monotonic change in R (or K) appears, for o is always 
changing in the same sense. Without the hypothesis, € would have cusps, 
and oy, instead of representing the total length of the arc ©,,, would give 
the algebraic sum of lengths between cusps. 

Now R;, and R, are less than 7/2, and, therefore, so is ,,. The latter, not 
being the measure of a great circle arc, by Theorem IV, must be greater 
than cz, the length of the geodesic joining the ends of the arc G,.. Hence 
Cy, < R,— R,. But R, and R, are the spherical radii of the osculating circles C, 
and C,, and ¢, is the spherical distance between their centers, so that the 
inequality just derived is the necessary and sufficient condition that Cy lie 
entirely within C,. 

A more general, though less intuitive, statement may be made with a different 
sense of “within” and the use of the geodesic curvature, so that osculating 
great circles are admitted. The curve € must cross each of its osculating 
circles if the geodesic curvature has a derivative different from zero at the 
point of osculation. If “within” means in that region into which € enters 
with increasing s, we may state the theorem: if for s’<s<s", z' is either 
always positive or always negative, the later osculating circles always lie within 
the earlier ones. It will be seen how this follows from the theorem as first 
stated when it is observed that the are consists at most of two pieces, on one 
of which K’>0, and on the other of which K’< 0. 

An immediate corollary of Theorem V is 

THEOREM VI. Under the hypothesis of Theorem V, the are s < s < 8 of € 
lies entirely within C, and entirely without C,. More generally, an arc of € 
on which x’ is always positive, or always negative, lies to one side of its osculating 
circle at either extremity of the are. 

For if © eut an osculating circle, or touched it, at a point other than the 
point of osculation, there would exist two osculating circles with a common 
point. This is contrary to Theorem V. 

One more step yields the analogue for spherical curves of Theorem 1: 

THEOREM VII. Jf € be supposed to be infinitely long, and if, from some 
point on, K’> 0, € approaches an asymptotic circle or point, according as K 
is bounded or not. If K'< 0, € approaches an asymptotic circle. 


510 0. D. KELLOGG [October 


We may assume J’ < 7/2, as indicated previously. Then as s increases 
indefinitely, R, whose sine is 1/K, and which therefore changes monotonely, 
must approach a limit, a, O<a<a/2. Accordingly, by (17), o approaches 
a limit. Hence the center of spherical curvature, Q, of € approaches a limiting 
position, and as R approaches a, it follows that € approaches the asymptotic 
circle which has for center the limiting position of Q, and for radius, a. This 
circle may reduce to a point if K is increasing, or to a great circle if K is 
decreasing. 

We close this section with the establishment of one more fact, which we 
shall need in what follows. 

THEOREM VIII. Jf the projections, x, y, 2, of P have four continuous 
derivatives with respect to s, € bends, with increasing s, to the same or the 
opposite side of its tangent great circle as ©, according as R is less than or 
greater than n/2. 

To see this, we compare the triple products (B, B’, B”) and (P, P’, P”), 
by expressing them in terms of the orthogonal set, 7, N, B. We start by 
differentiating the third Frenet formula, B’ = — N/r, and simplifying by 
means of the second: 


so that 
(B, BY, BY) = (B, 


t 


On the other hand, (P, P’, P”) = (P,T, K) = (P,T, N)/e, by (1), (2), 
and (10). But P must be expressed in terms of T, N, and B: P= (P-T)T 
+(P-N)N+(P-:B)B, or, by (5), (10), (7) and (12), 


(18) P = —oeN-+cosRB. 


Hence (P, P’, P”) = cosR/e, and we have, finally, 7? cosR(B, B’, B”) 
= (P, P’, P”), an equation of which the theorem is a qualitative translation 
into words. It will be noticed that the hypothesis of differentiability precludes 
the vanishing of the denominators og and t in the above reasoning. 

4, Curves with monotonic co-latitude. In a number of dynamical 
problems connected with the sphere, the motion is limited by two parallel 
circles. This section will be devoted to a curve, ©, on the sphere, bounded by 
two such circles, the distance of € from the plane of one of the circles being 


Ne _N_ _ 8B, T 
eT, ou or 


1923] CURVATURE AND THE TOP 511 


ever increasing, or ever decreasing, and its spherical radius of curvature 
having the same property. We shall suppose that this curve passes from 
tangency to Cy) at Py for s = s to tangency to C, at P, for s = s, (s > s). 
The subscripts 0 and 1 will be used generally to distinguish quantities or points 
connected with the beginning and end, respectively, of this are of ©. The 
direction of the common axis of Cj and C,, with the sense from the plane of 
Cy to that of C,, we shall call north, and take it for that of the z-axis. The 
x-axis of our orthogonal axis-system we take in the prime meridian, through P). 
The y-axis we take so as to form with the others a right hand system, or 
eastward as seen by an observer at Py. For the sake of definiteness we shall 
suppose that € runs initially eastward, a restriction to be removed later. 


Fig. 1. 


THEOREM IX. Let € be a curve on the unit sphere, S, whose position vector, 
P, has four continuous derivatives with respect to s on the closed interval 
8 Ss <8,, and let CG and C, denote two parallel circles on 8S, neither of them 
point circles. Let © pass from tangency to Cy to tangency to C, running 
initially eastward, under the following conditions on the open interval << s < s,: 
(a) the point P, of ©, corresponding to the arc-length s has ever increasing 
distance from the plane of Cy: 2’>0, 

(b) the spherical radius of curvature, R, of ©, is ever decreasing: R’< 0. 

Then the longitude of P,, measured positively to the east, exceeds the longi- 
tude of Po. 

It will be noticed that the hypothesis does not eliminate the possibility of 
an osculating great circle, for R may exceed 1/2. It is for this reason that 
the spherical radius of curvature is more appropriate for the present purpose 


512 0. D. KELLOGG [October 


than the curvature, K. The particular evolute here used is that defined 
previously (p. 506). 

An intuitive notion of the proof of the theorem may be gained from Figure 1. 
We compare the longitude, y, of P, on ©, with the longitude, ¢, of Q, on E. 
Initially y = y = 0, but wy increases initially more rapidly than ¢, and 
remains greater, while g is always increasing. Hence w is positive at P,. 
What follows is merely an examination of the details. 

We begin with an analytic formulation of the hypotheses: 


for s == 3s = = 0, 

for s = %, >r = j, 

(19) for %<8< 8, R’<0, 


(P.k) = T-k>0. 


The identity connecting the direction cosines of k with respect to the ortho- 
gonal set T, N, B will also prove useful: 


(20) (T.k)?+(N-k?+-(B-k)? = 1. 


Finally, it will be convenient to employ two vectors in the equatorial plane 
with the same longitudes as P and Q, 


V1—(P.kP V1—(B.k? 


(21) 


in which the denominators do not vanish for s»<s<.s, because of the 
relations (19,), and (19,), respectively, with (20). 

Our first task is to see that the initial longitudes of P and Q may be taken 
as 0. The first, Y% = 0, is a matter of definition, and is implied in the term 
“prime meridian”. As Q is always in the plane through P normal to ©, Q, is 
in the same meridian plane as Py, and, by the definition of ©, lies to the 
north of Py. But the important point is that Q is not separated from Py by 
the pole (the north pole is meant here and in what follows) on the shorter 
meridian are connecting these points. This is because the osculating circle, 
C’, of € at Py cannot go south of Cy without carrying € with it, which is 


1923} CURVATURE AND THE TOP 513 


contrary to hypothesis (19,). Hence Q) is either at the pole, or on the prime 
meridian. In the latter case, its longitude is an integral multiple of 27, which 
may be taken as 0. If Q) is at the pole, i. e. if C’ coincides with CG, we shall 
define yo as the limit of g as sso. It will presently appear that this limit 
exists, and is 0. 

We next show that ¢ is always increasing. To do this, we recall that the 
magnitude of the derivative of a unit vector is the rate at which it is changing 
direction, i. e. that |y’| = V(b’)®. A little reckoning is required to compute 
this quantity, but it is straight-forward, and need not be set down. One starts 
with (21,) and uses the relations (1), (16) and (20), obtaining 


in which the sign is determined as follows. Because of (19,) and (19), g’ never 
vanishes for s <s<s,, and so, being continuous, keeps its sign. By 
Theorem VIII, © bends in the same sense as ©, or in the sense opposite to 
that of ©, according as R ]} 7/2; hence & bends to the /eft. Moreover, by (16), 
& runs initially southward, being tangent to the prime meridian, so that it 
bends to the east, and ¢ is initially increasing. The equation (22) then shows 
that g’ is always positive, and g always increasing, as stated. The same 
considerations show that g approaches 0 as s > so, even if Qp is the pole. 

It remains to show that Y—g>0O. To do this, we note that by the 
definition of vector product, bx p = k sin(w—g). Hence sin (Ww —¢) 
= (b, p,k). Another brief reckoning, involving (20), (18), and the fact that 
Bx N = —T, gives the result 


o(T-k) 
(23 sin = — ——, 
) In(t p) V1—(P-k}? V1—(B.-ky? 


This shows that the angle w — gy, which is continuous, and which starts at 0, 
has a positive sine until P, is reached, so that YW, > 9, >0, as was to be 
proved. 

More, however, may be inferred from the above developments. The 
formula (23) shows that ass>s,, YW—g—Oora. Now, since R< the 
latter case can occur only when the terminal spherical radius of curvature, 
which must lie along a meridian, lies across, or terminates in, the pole. This 
means that the final osculating circle, C’, contains C, (within that region in 
which Q, lies), or coincides with C,. Also, w — @ increases toward 7, and ¢ 


—R'(T-k) 
(22) —(B-k)*’ 


514 0. D. KELLOGG {October 


is always increasing, so that w is increasing at P,. The result is that for this 
case € touches C, still running eastward. If Y~™—gy—0, as s—s,, the terminal 
radius of curvature cannot lie across the pole, and C”, with spherical radius 
less than the polar are to P,, must lie south of C,, and © then touches C, 
running west. We accordingly infer 

THEOREM X. If to the hypotheses of Theorem IX, we add (c) € touches both 
circles Cy and C, running eastward, then the difference in longitude of its points 
of contact with these circles exceeds 1. 

It remains to consider the case R’ > 0 (see Figure 2). As before, we may take 
w, = 0, and Q, is certainly between P, and the pole. For the former argument 


shows that Q, is not beyond the pole; and no more can Q, coincide with the 
pole, for from (16) we infer that — cosec? RR’ = x’, so that x’ would be 
negative, i.e. x decreasing, and € would have to bend to the south of C,, 
contrary to hypothesis. As to &, (16) now shows that it runs northward, and 
hence to the west, by Theorem VIII. As R’ > 0, formula (22) is valid without 
change, and g, initially 0, continually decreases. But by (23), sin(w—g) 
is positive, and it approaches 0 as s > s,. 

If yw — »—0, the terminal spherical radius of curvature cannot cross the 
pole, so that the spherical radius of C” is less than that of C,. Hence, as C” 
cannot lie north of C,, its center also lies south of C,. As @ is decreasing, and 
yw — » approaching 0 through positive values, we infer that w is decreasing, 
and that in this case € touches C, running west. We have here, ¥,< 4, <0, 
and the result is merely Theorem IX with the réles of Cy and C, interchanged, 
and © replaced by its reflection in the prime meridian. 


AN 


1923] CURVATURE AND THE TOP 515 


But if ~—y-—7, the terminal radius of curvature lies across the pole, and 
we infer that € is still running eastward at P,, so that we may state 

THEOREM XI. If in Theorem IX we replace the condition (b) by 
(b') the spherical radius of curvature of € is always decreasing, R' <0, 
and add the condition (c) of Theorem X, it then follows that the difference in 
longitude of the points of contact of © with the circles Cy and C, is less than n, 

Finally, we consider the removal of the restriction that € run initially 
eastward. If € runs initially westward, we may consider its reflection in the 
plane of the prime meridian. The proofs of the corresponding theorems remain 
the same, except that the other evolute from the one we have been employing 
must be used. Or, we may use a left hand system of axes, which will produce 
the same effect. The three last theorems may now be stated as follows. 

Let € run from tangency at Po to Cy, to tangency at P, to C,, the distance 
of the moving point, P, on ©, from the plane of C, having a positive derivative 
with respect to the arc length s for s5<s<s,. Let R be the spherical radius of 
curvature of © which is measured initially toward Py from the side of Cy on 
which C, lies. Then 

(1) if R’<0, or if R'>0 for 55 <s<, that one of the points P,, P,, for 
which R is the less, has the greater longitude, measured in the sense of increasing 
arc length at P,; 

(2) if R’<0 for s5<s<s, and the longitude of P is changing in the same 
sense at Py and at P,, the difference in longitude of these two points exceeds 1; 


(3) if R'>0 for s5<s<s,, and the longitude of P is changing in the same 
sense at Py and at P,, the difference in longitude of these two points is numer- 
ically less than x. 

In the above statements, the geodesic curvature, x, might have been used 
instead of #, but it would have necessitated a subdivision of cases according 
as x 20, or else a modification of the convention as to the sign of x. Neither 
seems desirable. \ 


PART II, SOME POINTS IN THE THEORY OF THE TOP 


5. Osgood's intrinsic equations. Notation. Some familiarity on the 
part of the reader with Professor Osgood’s paper will be assumed in what 
follows. His intrinsic equations, however, will appear here with a change in 
a sign, and a slight change in notation.* We write them as follows: 


*The change in sign is due to a different convention as to the senses in which certain 
quantities are measured. In justification of a departure from Professor Osgood’s well con- 
sidered conventions, 1 can only plead that I have felt surer footed in employing conventions 
to which I have been accustomed. Since there is little agreement in the literature on the 
subject, the departure will cause the reader little inconvenience. 


516 0. D. KELLOGG [October 


dv 


Av “de = y 
(24) Ar? x—Crv = N., 
yar 8. 

ds 


Here C and A are the moments of inertia of the top about its axis of dynamic 
symmetry, and a perpendicular to the axis through the fixed point, respectively. 
The magnitude of the velocity of the point P, where the axis of the top (on 
which a positive sense has been arbitrarily fixed) pierces the unit sphere, is 
denoted by v; the geodesic curvature of ©, the path of P, or the “bending” 
of the cone swept out by the axis of the top, is denoted by x; x is positive 
when € swerves to the left of an observer walking on the sphere and following 
the top axis. The sense of the positive unit tangent vector, t, to ©, is that of 
the motion, and that of the normal vector, m (here tangent to the sphere, and 
not to be confused with the principal normal vector, N, of Part I), is to the 
left of the above mentioned observer. Thus a curve with positive x bends 
toward the positive normal. The component of the applied moment, along 
the axis, or the spin moment, is denoted by S. The remaining component of 
the applied moment may be regarded as due to a force tangent to the sphere 
and applied at P. We denote the components of this force in the directions 
of t and n by T and N, respectively. In Professor Osgood’s notation, this 
force has components 7 and Q along the tangent and negative normal vectors 
associated with €, but his normal vector points to the right, so that his Q and 
the present N are identical. Here, a positive moment or rotation about an 
axis would force a right hand screw forward in the positive sense along the 
axis. Thus our S and x are opposite in sign to the corresponding N and r of 
Professor Osgood’s paper. The only change in sign resulting from these 
differences is in that of the term Cyv in the second intrinsic equation. 

6. The energy integral. This is obtained, in case it exists, from equations 
(24,) and (24,), and may be written 


(25) (Ar*+Cr*) = (7 + dsth. 


ds 


provided the integrand is the derivative with respect to s of a function of 
positibn, as it is, for instance, in the case of the heavy frictionless top with 
fixed peg (O, p. 256, (i)). 


1923] CURVATURE AND THE TOP 517 


7. The heavy top with fixed peg. Initial sense of change of 
latitude. The equations of the motion may be written (O. p. 258. (6) and (iv). 
p. 256, (i)) 


Here = cos 6. 4 being the co-latitude of P; w is the longitude of P, 
positive when measured eastward; 1). Wo. and are initial values, 
corresponding to some moment when A d6/dt) QO; (i — sg) Wo: 
2 sin? = (1 — u2) we; v is the constant value of y = Cv/A;* 

2Mgh/A, where M is the mass, and / the distance of the center of mass 
from the fixed point. The function P(r) is given by 


(27) (1 — 0?) [2 + — — 


The motion takes place between two parallel circles, C, and C,, which may, 
in special cases. reduce, one or both, to point cireles. We shall disregard 
such special cases in the present study. 

The first question that arises when the initial conditions include 6, = 0. 
as at present, is, does the top begin to rise, or to fall, or, is the initial limiting 
circle the lower, C,. or the upper, C,? To answer this. one compares the 
initial value of x. or 


) . a“ . 


* We shall assume v, and hence y, to be positive, or when so specified, zero; never 
negative. It will be recognized that this restriction merely involves, in some cases. 
reflecting the motion in a mirror. 


du \* 
Paw). 
dt 
dur e+ — #) 
—= 
ds r(1— 
(26) 
| 
0 = 
36 


518 0. D. KELLOGG | October 


with the geodesic curvature, ~, of the initial limiting circle. But 


. 7 f= . 
% = sgn Ww, cot sen , or, as Vi— us sgn 
962 
0 
= Hence 
(29) (%,-—%,) W, | Mo 


If x, > %-, € bends more rapidly to the left than the initial limiting circle. 
i. e. if running east, € rises, and similarly for the other cases, the sign of 
(%—%e) Wp being decisive. The function in brackets is the quadratic whose 
roots give the longitudinal velocities of steady precession in the circle of lati- 
tude 6 = @,. If Wo is positive between the roots of the 
quadratic, and if 7,< 0, the opposite is the case. If wu = 0, there is but 
one root, and (%,— x-) Wy> 0 for Wp greater than this root. Hence we may 
state 

THEOREM XII. Jf, at any instant, the path of P touches a circle of latitude, 
it will curve away from, or toward, the equator according as its longitudinal 
velocity at the instant does, or does not, lie between the longitudinal velocities 
Sor steady precession in that circle. If the circle is the equator, the path curves 
northward or southward according as the longitudinal velocity does, or does not 
exceed the single longitudinal velocity for steady precession on the equator. If 
the spin is so slight that the precessional values are coincident, or imaginary, 
the top always falls. 

It is, of course, understood that the initial longitudinal velocity is not 
a precessional value in the statement of the above theorem. 

8. Reversals in sense of the longitudinal motion. Equation (26,) 
gives the value of « for which the longitudinal motion changes sense: w = t% 
+-e/y. In order, however, that this value of « shall correspond to a real point 
in the motion, it is necessary first, that it lie in the closed interval (—1, +1), 
and secondly, that P(m)>0. As P(w)< 0 if u<—1, these conditions may 
be stated as follows: 


(30) 


Pian) 7) Wo | Wo i 


1— 


‘The function sgnz is the familiar function siynum z, defined as follows: for -<0. 
sgnz = —1, for z = 0, sgnz = 0, and for z>0, sgnz = +1. 


. 7 
1+ 


1923] CURVATURE AND THE TOP 519 


where 0 —- «/2 is the value of the longitudinal velocity for steady precession 
on the equator. 

9. Points of inflection. The curve € has a point of inflection at P when 
its projection on the plane tangent to the sphere at P has a point of inflection 
at P, or, when € crosses an osculating great circle. A necessary condition for 
this is x — 0; it is sufficient that x change signs. Equation (263) gives, as 
the only value of « for which this can occur, wu; = u,+(2v;/a)—(¢/y). Here. 
the conditions that «; correspond to a real point in the motion may be given 
the form 


Fi (te) the — SO. 
(31) P(w) = = 0) — 20) 
( 


9 9 
— u? | 


L 0 J 


The factorization in the value of P(1;) has been carried as far as possible in 
the domain of rationality (a, 7, uw, V1—u2). This may be verified by con- 
sidering the special rational values a = 16/5, y = 4/5, 6 = 2, mu) = 3/5, 
which reduce the cubic factor to the form 2° — 22?-+ 22+ 2. a polynomial 
evidently without rational roots. 

10. Monotonic curvature. For the application of some theorems of 
Part I, we need assurance that #’ keeps its sign between the limiting circles. 
This derivative. obtained from (26;), may be reduced, without excessive 
trouble, to 


er sis Bae 


Striking is the fact that like the curvature, x, and the longitudinal velocity, 
w, this derivative is dependent for its sign on a linear function of u. The 
derivative, uw’, of wu, keeps its sign between the limiting circles. The value 
of for which x’ vanishes is up — (3¢/y) + (4v2/a), and the conditions 
that it characterize a real point in the motion may be given the form 


520 0. D. KELLOGG | October 


Py ( Uy ) —- 


8 3 
a 2 


| 5 3) 1287 (1— 22) Fie— ( 


The cubic factor is, in this case also, irreducible in the domain (a, 7. 4. 
1 1- ), as may be seen by using the special values a = 24, 6 = 2, u)— 0. 

11. Applications of the geometry of spherical curves. It is not 
the purpose of the present paper to enter into a detailed discussion of the 
various cases of motion of the top which may present themselves, although 
the materials gathered above permit new distinctions between types.* We 
shall rather content ourselves with the enunciation of certain typical results, 
following the customary cases (O, p. 258).+ 

Case I. The longitude is always increasing (OU. Figure 3. 1). In this case, it 
is usually assumed that the path of P has points of inflection. The illustrations 
of the most current text books show such a curve.z It should be noticed, 
however that such is not necessarily the case, and that in the present type 
of motion both paths with inflections and paths without inflections can occur. 
Thus, for a = 2, d= y == 1, wy = 1/2, with Ui near to 1, the condition (30,) 
for a change in sense of the longitudinal motion is not fulfilled, so that the 
longitude is always increasing. The condition (31,) for a real point of in- 
flection is fulfilled, while the condition (31,) takes the form Y— <0 for 
Wy near 1. Thus, with ) slightly less than 1, we have a path with a point 
of inflection on each are between the limiting circles, and with Wy slightly 
greather than 1, we have an inflectionless path. 

If the spin be stopped (y = 0), the top becomes a spherical pendulum. 
The longitude always changes monotonely, (26.), and # reduces, by (26) to 
—ae/2v*. Thus the path of the spherical pendulum never has inflection points. 
The use of the intrinsic equations has rendered extremely simple the proof 
of a well known fact. 


Mr. A. H. Copeland, of the Graduate School at Harvard, is undertaking such a dis- 
cussion in connection with his candidacy for the doctorate. 
+ Case I is the wavy curve without cusps or double points, Case II is the curve with 
cusps, and Case III, the curve with loops. 
; Professor Osgood has also overlooked the necessity of imposing the conditions (31) 
on % in order to secure a path with inflection points (see 0. p. 259). 


“T lg) 


1923] CURVATURE AND THE TOP 521 


Another fact about the spinning top that seems hitherto to have escaped 
explicit mention is the following: there exist cases in which the longitude of P 
increases by more than a between two successive contacts with the limiting 
circles.” 

Sufficient conditions for this type of motion are (see Theorem X): the 
longitude is always increasing, the top is rising from u = uw, and &’<0 
(the definitions of R in Section 2 and Section 4 coincide in the present case, and 
we find trom (15) that the last condition is equivalent to z’> 0). An example 
showing the compatibility of these conditions is the following: « i.r—1I, 
uy == 0.4, Wy — 1.44812. We find u, = 0.5, « — 1.21222, o2 —- 1.74938. 
The condition (30,) for a change in sense of the longitudinal motion is contra- 
dicted, while W is initially positive; z’ is seen from (32) to be initially positive. 
while the value « (p.519) for which #’ changes sign is found to be greater 
than 1, and so does not correspond to a real point on the path.7 

Case II. Here Wo 0, and by (32), # can vanish only for u = uw. We 
infer, from Theorem VJ, that an arc of the path of P between two successive 
contacts with the limiting circles lies entirely within the osculating circle at the 
extremity of the are at which the curvature is finite. 

Case II]. The same situation obtains here, where the path has loops. Let 
us use © to denote an are of the path between the limiting circles. Then € 
lies entirely within its osculating circle at one extremity, and entirely without 
its osculating circle at the other extremity, one of these circles containing the 
other in its interior, by Theorems V and VI. ‘Interior’ may here be interpreted 
in the narrower sense, namely, the region with the less area. 

To justify these statements with regard to Case III, we must show that for 
the looped curve, neither z’ nor # vanishes between the limiting circles, i. e. 
that the conditions (33) and (31) are both incompatible with the conditions (30), 
when the strict znequalities are employed in the first two. We shall give the 
proof for the conditions (33), that for (31) being similar, and simpler. 

In the first place, it is no restriction to assume that Wo > 0, for this merely 
means that a proper choice has been made of that limiting circle which is 
to be the initial one, inasmuch as € meets the limiting circles running in 


If P(w) has wu = 1 as a double root, © makes a spiral around the north pole. It 
seems entirely plausible that near this motion are others of the type in question, where 
the increase in longitude is arbitrarily large. It may also be of interest to note that 
a similar situation exists in a very elementary problem, namely, the following: a bead is 
free to slide under gravity without friction on a circular wire, which rotates with constant 
angular velocity about a vertical diameter. It will be found that the wire makes more 
than a half revolution between two successive times when the bead attains its extreme 


heights. 
7A computation, which I believe to be accurate, gives, to four significant figures, 
Vo, = 3.428, which is about 9.1 percent in excess of z. 


522 0. D. KELLOGG [October 


opposite senses. This assumption greatly simplifies (30,), reducing it to 
> 0, so that the conditions for the loopy type of curve become 


‘ j 
(34) 2 


We proceed to show that for Ww, thus limited, the conditions derived from (33), 


0 


a(a—60" 


126°(1—w?) 


| 
0 


are incompatible. If the first of these inequalities be multiplied by ”% and 

subtracted from the second, we find a necessary condition on Y, for their con- 

sistency which reduces to w%[302(1-+%)—a]+a0/2>0. From (34), 

recalling that y == a/26, we have a>407(1+ wu), so that the coefficient 

of Ws, is negative, and 

«0/2 

a) 

If from this we form the inequality for w,/20, and in it substitute 40° (1 -- 
a(1—z.), so that x is always positive, we find as upper bound for w,/26 

a function of 2 whose maximum is 1, whereas, by (34), W,/26 must be greater 

than 1. Thus the incompatibility of the conditions (33) is established. 

We may state further concerning this case, the consequence of Theorem IX: 
the general drift of the longitudinal motion is in the sense which it has at that 
one of the limiting circles where the curvature is numerically the less. This is 
a property whose proof has previously required less elementary methods (see 
the reference, O, p. 260). 

If the spin of the top be stopped again, so that the spherical pendulum is 
before us, it is known that the difference in longitude between two contacts 
of the path with its limiting circles exceeds 7/2 and is less than a. The first 
inequality is obtained by the appraisal of a definite integral. The second has 
required the use of Cauchy’s integral theorem, or other less elementary method 
(see Appel], loc. cit.). It is readily verified that this second inequality is an 
immediate consequence of Theorem XI. The hypothesis of the theorem which 


» “ 
<0. 
4(1+ &) 
(35) 


1923] CURVATURE AND THE TOP 523 


interests us may be given the form Wz’ <0, for with the definition of # 
there employed, the relation (15) is to be replaced by * = sgn w, cot R, 
when we consider a rising arc, ©, of the path. For y = 0, (32) takes the 
form x = —3a?u'(1—uw?) W,/4v°, so that the hypothesis is fulfilled. 

12. Asymptotic circles. On page 225 of his article, Professor Osgood 
mentions an interesting case of motion of the symmetric top, in which it 
is subjected to a force of constant magnitude, directed always along the 
positive tangent to the path, ©. He says that * now approaches 0, and “‘it is 
a matter of conjecture as to whether © has an asymptotic great circle”. 
Inasmuch as it appears rather obvious that it should have, the statement 
might seem to be excessively cautious. But the example given in Section 1 
shows such caution entirely justified. As a matter of fact, € has an asymptotic 
great circle, by Theorem VII. For, if f denote the magnitude of the force. 


we find s = re (t--t,)® + r9(*#—t)), so that the path is infinitely long: also 


that = Cr/Av, where = v3 + so that and <0, and 


hence K’'<0. The hypotheses of Theorem VII are therefore fulfilled. 

Certain general criteria may be set up for motion with an asymptotic circle. 
We shall suppose that from some point (¢t = f,, s = s)) on, v does not vanish. 
and that the functions involved have whatever continuous derivatives are 
required. Then the path will be of infinite length if the integral 


is real and finite for alls >.s,. This is a first condition. 
If we differentiate with respect to s the equation (24,), and simplify the 
result by means of (24,) and (24,). we obtain 


T(Crv+2N) 


Are 


That the right hand member of this equation, which we assume to be con- 
tinuous, should never vanish, is the second of the desired conditions. 


ds 
/ 2 
+. Tds 
ds 


0. D. KELLOGG 


In the case of a purely tangential force, the latter condition takes the 
following form: 7 shall not vanish for s>s,. If 7 is negative, the motion 
comes to a halt unless fi T) ds < Av?/2 for all s >s,. Otherwise, the 


path has an asymptotic circle, which may, however. reduce to a point provided 


T' <0 and r—0. i.e. | (— T) ds = Av?/2. The asymptotic cirele is a great 
e 


x 


circle if 7’=>0O and | Tds is divergent. 
0 
HARVARD UNIVERSITY, 
CAMBRIDGE, MASS. 


. 
So 
Sa 


THE GREATEST AND THE LEAST VARIATE UNDER 
GENERAL LAWS OF ERROR* 


BY 


EDWARD LEWIS DODD 


INTRODUCTION 


To fit frequency distributions, several functions or curves have been used, 
most of which are generalizations of the so-called normal or Gaussian or 
Laplacean probability function 


The differential equation satisfied by this function was generalized by Karl 
Pearson.+ Gram,{ Charlier,§ and Bruns! used the normal function and its 


successive derivatives, with constant coefficients, to form a series, of which, 
in practice, only a few terms are used. Jorgensen] developed a logarithmic 
transformation, in which x is replaced by log x. Associated with the Law of 
Small Numbers is the Poisson exponential] function e4 2%/x! for which Bort- 
kiewicz** gave a four-place table, and Sopert7 a six-place table. The Charlieriit 


* Presented to the Society, April 28, 1923. The word variate will refer to any of the 
particular values which a variable may take on; e. g., the height of some specified soldier 
in a regiment, — the greatest variate here would be the height of the tallest soldier in 
the regiment. 

t Contributions to the mathematical theory of evolution, Il: Skew variation in homo- 
geneous material, Philosophical Transactions A, vol. 186 (1895), part I, pp. 343-144. 

t Uber die Entwickelung reeller Funktionen in Reihen mittelst der Methode der kleinsten 
Quadrate, Journal fiir die reine und angewandte Mathematik, vol. 94, pp. 41-73. 

§ Uber die Darstellung willkiirlicher Funktionen, Arkiv for Matematik, Astronomi 
och Fysik, vol. 2, number 20. 

|| Uber die Darstellung von Fehlergesetzen, Astronomische Nachrichten, vol. 143. 

| See Arne Fisher, The Mathematical Theory of Probabilities, I (2d edition), pp. 236-260. 

** Das Gesetz der Kleinen Zahlen, 1898. See Arne Fisher, loc. cit., p. 266. 
+f Pearson’s Tables for Statisticians and Biometricians, pp. 113-121. 

tt Meddelanden fran Lunds Observatorium, 1905. Vorlesungen iiber die Grund- 

ziige der Mathematischen Statistik, p. 6, 79-85. See also Arkiv, loc. cit. 
525 


= 
— 
oV 2a 


526 E. L. DODD (October 


B-Series, for integral variates, makes use of the Poisson function and its 
differences. The Makeham* life function is well known in life insurance. 

Dealing only with the normal function itself, Bortkiewicz+ determined 
mean and modal values for the interval of variation, i. e., the difference 
between the greatest and the least of m variates. For this same problem there 
remains to be considered the median and the asymptotic value of the interval 
of variation. The asymptotic value is a function of » which, with a probability 
converging to certainty, gives the interval of variation with a relative error 
small at pleasure. To make the problem broader, we shall consider the greatest 
and the least variate individually, and shall set up six general classes of 
functions which include as special cases the frequency functions in common use. 

These six classes of functions are distinguished as follows. Apart from 
a factor W(x) satisfying certain inequalities, the probability-function for large 
values of x is, respectively, 


gf: (A) (5) (6) 


witha >0,7y>1,0<g<1,c¢c>1. The first represents a finite interval; 
the second is involved in Pearson types; the third, with @ = 2, is the normal 
probability function; the fourth leads to logarithmically transformed functions; 
the fifth, to the Makeham life function; the sixth to the Poisson exponential 
function. 


1. DEFINITIONS 


Definition 1: Probability function. The function g(x) is a probability 
function for specified variates, if for each variate 


= 


is the probability that the variate will take on a value equal to or greater 
than x. 

Even when the statistical material must be given in integers, it is custo- 
mary to think of g(x) and ®(z) as continuous, especially when the number 
of variates is large. When it is desirable to provide at the same time for 


*Journal of the Institute of Actuaries, 1860. See Institute of Actuaries’ Text 
Book, Part II, Chap. VI. 

+ Vaniationsbreite und mittlerer Fehler, Sitzungsberichte der Berliner Mathe- 
matischen Gesellschaft, Jahrgang 21, Sitzung am 26. Oktober 1921. 


| 


1923) GENERAL LAWS OF ERROR 527 


continuous and discontinuous probability, the Stieltjes integral* may be 
used. 

Definition 2: Asymptotic certainty. An “event” dependent upon » variables 
or variates is asymptotically certain if for any positive 7, small at pleasure, 
it is possible to determine an n’ so that when n > n’, the probability that the 
event will happen is greater than 1—y. 


2. THE ASYMPTOTIC VALUE OF THE GREATEST VARIATE 


THEOREM I. Jf the probability function g(x) = 0, for x>x2, and if 


J 9 (x) da + 0 when x < xz, then it is asymptotically certain that the greatest 


of n variates will differ from x2 by less than any preassigned positive €. 
Proof. By hypothesis, 


Jf = 


Then the probability that all the » variates will be less than x.—« is (1— 0)", 
which approaches zero with increasing 7. 

A similar statement can be made for the least variate. In fact, in all the 
theorems which follow the treatment of the least variate will be omitted as 
obvious. Of course, x will often be replaced here by |x|. 

THEOREM II. Jf, for positive x, the probability function ist 


g(x) = 


with «@, ky, ky, positive constants, and ki << W(x) < kg, then it is asymptotically 
certain that the greatest of n variates will be 


nater/a ’ 


where \e'|< small at pleasure. 


* See R. von Mises, Fundamentalsiitze der Wahrscheinlichkeitsrechnung, Mathematische 
Zeitschrift, vol. 4 (1919), pp. 1-97. 

+ Here, and in the following theorems, if g(x) has the indicated form merely when z is 
greater than some given constant, the theorem remains valid. 


87* 


528 E. L. DODD [October 


Proof. By hypothesis, 


oo 


xr 


Hence, if with « > 0 small at pleasure, we set 
it follows that 


oo 


ks 
Js an té 


r 


Thus, the probability that a specified variate will be less than this x is 
greater than 


And the probability that all variates will be less than this x is greater than 


a n't € 


But this approaches unity as » approaches infinity. And thus, with 7>0 
small at pleasure, it is possible to find n’ so that if n>’, the probability 
that all variates will be less than n°+®/@ is greater than 1—4y. 

Similarly, using 


it can be shown that the probability that all variates will be less than 


e/a 


is less than 4 for m greater than some n”. 


x 
_ 
1— 
0, 
(1— an 


1923] GENERAL LAWS OF ERROR 529 


Thus, for large enough n, the greatest variate will lie in the interval 
from to unless all variates are less than n‘'~*/*, — for 
which the probability is less than 47, — or unless some variate surpasses 
nit2/@ — for which the probability is likewise less than 47. Hence, by 
Definition 2, it is asymptotically certain that the greatest variate will lie in 
the interval from to ©/4, 
THEOREM III. Jf the probability function is 


g(x) = W(x), with F< w(r) <x’, 


where a, 8, g are positive constants, and g <1, then it is asymptotically certain 
that the greatest of n variates will equal 
(—log, with |e |<e, 


small at pleasure. 
Proof. Let 


t* 4B = g* = 
I fo t” dt, 
zx 
Then, integrating by parts, 


1 
> 0, 
a (—loge g) 


Thus, 


provided 8—a+1<0. But, even if 8—o«-+1>0, the process performed 
k times will put into the numerator 8—ke +1, which is ultimately negative. 
Hence 


I<g™. F(z), 


where F(z) is a sum of powers of x with constant coefficients. 


x 


530 E. L. DODD [October 


Suppose, now, that 


= (—logyn)/*(1+e), 


é >0, small at pleasure. Then, 


where 1+ 26, = (1+6¢)%, >0. But, for large enough n, F(x)<n". 
Then 


1 
nite 


I< 


Hence, the probability that the » variates will all be less than x is greater 
than 


for n> some n’, 
Now, let J be the result of replacing 8 by —A in J. If, after integrating by 
parts, — 8—a+1>0, then 


But if —8—a-+1<0, a second integration by parts yields —8—2e+1, 
which is again negative. By combining the two results, 


where G (a) is a sum of powers of x with constant coefficients. If, now, 


x = (—logyn)/*(1—e), 
then for large n, 


a 1 
1 n 
>1—47 


1923] GENERAL LAWS OF ERROR 531 


where 1 — 2¢, = (1— «)*, ¢ >0. Thus, the probability that the variates 
will all be less than z is less than 


1 n 
when x > some n”. 
THEOREM IV. Jf the probability function is 


g(x) = .w(z), 


with W(x)< x", where c, 9, B, are positive constants, O<g<1,c>1, 
y >1, then it is asymptotically certain that the greatest of n variates will equal 
(1+e’)/ 
0) 
with | small at pleasure. 


Proof. The proof follows the same general course as in the preceding 
theorem, with 


oo 
v= I = {oe dt, J = f vr Par, 
zx x 


Upon integrating J by parts, the new integral contains the same integrand 
multiplied by a factor which is increased if the negative portion is dropped, 
and ¢ is replaced by x. Thus 


T<&(x)+ 


where, indeed, ¢(a)<1 when ~z is large and where the principal factors 
of &(x) are g Bex and powers of x. As for J, we may first take 8 > 1, and 
again in the new factor set ¢ = a. 

Now, setting 


nm nm 


it follows that 


(logex == —mloggn, (loge x)’+ rlogygx —~m’ loggn. 


532 E. L. DODD [October 


By division, 


tlogex-logge 
(logex)” m 
Hence, since 7 > 1, 
m 
lim — = 
r—>o 


Thus, asymptotically, the effect of «* disappears when combined with gb, 
Now set 
(1+e) 
x == 


Then 
(logex’ = (—log, n)'** > (1+ ¢)(—log, n), 


provided » is large enough. Thus, since g<1, 


< 


With ¢, suitably chosen, positive and less than ¢, the probability that all 
variates will be less than x is greater than 


which approaches unity as a limit. 
THEOREM V. If the probability function is 


(2) = - W(x), with w(x) 


where b, c, and g are constants, 0<g<1,b>1,c¢>1, then it is asymp- 
totically certain that the greatest of n variates will equal 


[loge (— log, n)] (1+ «’), with \e’| <e, 


small at pleasure. 


=r 


1923] GENERAL LAWS OF ERROR 533 
Proof. The proof follows the same general course as in Theorem III, with 


= vidi, J= | vb-dt, b = =. 


r xr 


Then, upon integrating by parts, we find 


WF 
loge) 


b 
At all events, xz < 1, for large enough k, so that by repetitions of the process, 


I<: F(x), 


where F(z) is a sum of terms such as bt with constant coefficients. Likewise J 
can be proved greater than a similar expression, noting that log (bc) >0. By 
first setting « = [log-(—loggn)] (1-++«), and then x = [log-(—loggn)] (1—«) 
and noting that for large n, (— loggn)<n?, d>0, small at pleasure, the 
required inequalities can be obtained. 

THEOREM VI. Jf the probability function is 


g(x) = W(x), with W(x) < b*, constantb>1, 
then it is asymptotically certain that the greatest of n variates will be 


X(1+<«'), where = n, with <«, 


small at pleasure. 
Proof. Set 


I = f = f 


Integrating by parts, dropping a negative term, replacing ¢ by x, we obtain 


1 


I< b® + Iz log b, where z = 


. 
zx 


534 E. L. DODD [October 


Likewise, 
J > zx-* b-* — {zlogb+ 222} J. 
dz dy , 
Moreover, if = and = n¥ , then asymptotically — =y. 


That is, a percentage error in y is controlled by an equal percentage error 
in x; and b”, when combined with x”, makes no asymptotic contribution to 
the exponent of 7. 


3. THE MEDIAN* VALUE OF THE GREATEST VARIATE 


The probability that every one of nm variates will be less than G is, as is 


well known, 
o nt 


1 — fo 


G 


where ¢(z) is the probability function. If, now, we determine G so that this 
expression is equal to 4, then it is equally likely that the greatest variate 
will or will not exceed G. This median value of the greatest variate is thus 
obtained by finding G so that 


fowa = 1—?2-™., 


G 


While a median, mean, or modal value for the greatest variate may be more 
difficult to compute than the asymptotic value, in the foregoing theorems, the 
former will, in general have more significance in practical problems. However, 
even here, the asymptotic value may be useful for a rough simple check. 


4, THE NORMAL? PROBABILITY FUNCTION 


If, in Theorem III, we set 


= = — logeg, a=2, 


*In the theory of errors, the so-called “probable error” is the median of the absolute 
values of the errors. Thus, it is equally likely that an error, taken positively, will or will 
not exceed the probable error. 

+ Rietz, in his article Frequency distributions obtained by certain transformations of 
normally distributed variates, Annals of Mathematics, ser. 2, vol. 23 (1922), pp. 292-300, 


GENERAL LAWS OF ERROR 


Hence, under the normal probability law, it is asymptotically certain that the 
greatest variate will be, apart from the factor (1+ ’), 


= V2logen. 


V loge n 
(— logy = 


These results hold also for a Gram series with but a finite number of terms, 
since the polynomial factor has no asymptotic influence. 

On account of the symmetry of the normal function, an average value for 
the variation interval is obtained by doubling the corresponding value for 
the greatest variate. The following table compares the median and asymp- 
totic values of the variation interval, computed by the formulas of this paper, 
with the modal, mean, and restricted mean (“bedingte ... mathematische 
Erwartung”) values obtained by Bortkiewicz.* Bortkiewicz, indeed, after 
noting that his mean determination is very close to the average of the other 
two, gives examples from anthropometry and roulette in which the actual 
variation is close to his mean value. 

Variation Interval 


1 
Values of —— for n variates subject to 
oV 2a 


Number of | Modal Median | Mean Restricted | Asymp- | Asymptotic 
variates value value value mean value totic oS 
n Bortkiewicz), (Dodd) ‘Bortkiewicz)|(Bortkiewicz)'! (Dodd) Median 
100 76 92 . 1.23 
1,000 As 1.16 

10,000 | 7. 58 1.13 
100,000 | 3.69 8 . 1.10 


The asymptotic value of the interval of variation is thus about 23% too 
large when n = 100, and is still 10% too large when n = 100,000. 


considers in particular the transformation x” = kx", which would, for example, give the 
distribution of volumes of similar solids — “oranges” — if the ‘‘diameters” are normally 
distributed. In such a case as this, where x” is an increasing function of x, the asymptotic 
value of the greatest “volume” can be found by finding first the asymptotic value of the 
greatest ‘‘diameter”. 

*Loc. cit. See also Nordisk Statistisk Tidskrift, vol. 1, pp. 11-38. 


535 
1923] 
then 


536 E. L. DODD (October 


5. THE PEARSON FREQUENCY TYPES* 


The Pearson frequency types are given below with the asymptotic interval 
of variation for each, and the number of the theorem involved. 


Asymptotic interval of variation for the Pearson frequency types 


Type Frequency function Interval of variation Number 
| number of theorem | 
9 
va x 
2\va 
x 
yo (1-3) 2a I 
| 
loge n 
(1+ a+ I, 11 
a 
IV Yo (1 +. =) e tan \%/@) 2 mn} (2m m> 4 Il 
VI Yo x>a —q@, iI, Il 
— | 
VII 2 loge n | 


6. OTHER FREQUENCY FUNCTIONS 
1. Jorgensen function. The Jorgensen function is of the formt 


where k, &, and 6 are constants, and z>0. It can be written 


k' a? g 


(logax? 


where 


logeg = 262” = = constant. 


* Certaii special and limiting cases have also been designated as “types”. 
+See Arne Fisher, loc. cit., p. 241. 


1 
ke 2+ 4 
| 
| 


1923] GENERAL LAWS OF ERROR 537 


Then, by Theorem IV, the asymptotic value of the greatest variate is 


(1 +6), with 


small at pleasure. 
2. Poisson exponential function and Charlier B-curve. The Poisson 
function has the form 


x! 
But, by Stirling’s formula, 


For the asymptotic value of the greatest variate, the only significant factor 
here is x”. This asymptotic value, by Theorem V], is 


where = n, |é'|<e, 


small at pleasure. The Charlier B-curve for integral variates is obtained from 
the Poisson function by differencing. But 


= 1— I, 


and this bracket has no asymptotic significance. Hence, if only a finite number 
of terms are taken, the greatest variate, asymptotically, remains that deter- 
mined as above. 

3. Makeham life function. The Makeham formula for the number of 
survivors at age x from an original group of /, individuals just born is 


le = ks® 


where k > 0, O<s<1, O<g<1,c>1. Postulating a stable population 
supported by the same number of births annually, and assuming that the 
theoretic relative frequency is the equivalent of probability, the following table, 


r 


538 E. L. DODD [October 


based upon Theorem V, gives the age of the oldest individual, in accordance 
with constants used in the American Experience Table, as makehamized by 
Arthur Hunter,* in the Institute of Actuaries Table (H™), and in the MeClin- 
tock Annuitant Tables, makehamized by W. M. Strong.* 


Age of oldest of n individuals 


By asymptotic formula log, (— log, m), where ks” g°- 


\ For population of 


Makehamized mortality table 


lone thousand, one million | one billion 


oldest age | oldest age | oldest age 


American experience ............... | 95.1 101.7 | 105.5 
Institute of Actuaries............... | 96.3 103.9 108.3 
| McClintock-Male .................. | 97.8 105.3 109.7 
| McClintock-Female................. | 100.8 


108.3 112.7 

While these results are somewhat crude, it seems surprising that the asymp- 
totic formula which dispenses with the factor s* could do so well. The question, 
indeed, arises whether any graduation formula can throw much light upon 
extreme ages, because of the gross irregularities commonly found at the ends 
of biologic series. 


7. SUMMARY 


The interval of variation is the difference between the greatest and the 
least of n variates in a distribution. Theorems are here given for the greatest 
variate; corresponding theorems can be stated for the least variate, using «| 
in place of « when necessary. In the following table which summarizes these 
theorems, the letters stand for positive numbers; they are constants except 
and G. Moreover,g<1; butb>1,¢>1, y>1. For each variate 


f y(x) dx is the probability that the variate will be equal to or greater than 2:. 


With ¢(x~) = 9, (x)- w(x), the two factors are each described below. As n 
increases indefinitely, a probability converges to certainty that the greatest 
variate will take on the stated asymptotic value, with a relative error small 
at pleasure for the values in Classes I, II, V, and VI, and for 1/e@ and 1/y in 
Classes II and IV. 


*Transactions of the Actuarial Society of America, vol. 7, p. 200, p. 289. 


1923] GENERAL LAWS OF ERROR 


Asymptotic value of the greatest of n variates 


When each variate is subject to (x)- ¢ (x).* 


Conditi 
Class $i (x) onditions Greatest variatey Applications 


for (a) 


5 Pearson Types 


3 Pearson Types 


Gaussian Function 
Grams Series (finite) 


(—logy 


| Jorgensen Logarithmic 


| Function 


| loge(—loggn)  Makeham Life Function 


J Poisson Exponential 


< |G, with = Charlier B-Series (finite) 


Asymptotic values have a theoretic importance because of the rigidity of 
the determination. Possibly, they may be used unreservedly in problems 
where the variates are as numerous as atoms; but in most practical problems, 
their chief value would seem to be in furnishing a rough check upon mean, 
modal, or median values. The latter can be found by determining G so that 


dx = 1—2-™", 


G 


* Or merely subject when «x is sufficiently large. 
t+ Apart from the factor (1+ e'), with |e’|<Ce, small at pleasure. 


t Provided Je (x) + 0. 


UNIVERSITY OF TEXAS, 
AUSTIN, TEX. 


539 

| 0,4 >a, 
a 


538 E. L. DODD [October 


based upon Theorem V, gives the age of the oldest individual, in accordance 
with constants used in the American Experience Table, as makehamized by 
Arthur Hunter,* in the Institute of Actuaries Table (H™), and in the McClin- 
tock Annuitant Tables, makehamized by W. M. Strong.” 


Age of oldest of n individuals 


By asymptotic formula log, (— log, ”), where ks® 


For population of 


Makehamized mortality table |one thousand one million | one billion 


| oldest age ~ oldest age | oldest age 
| 
American experience | 95.1 101.7 
Institute of Actuaries ............... 96.3 103.9 
| McClintock- Male - | 97.8 105.3 
McClintock- Female | 100.8 108.3 


While these results are somewhat crude, it seems surprising that the asymp- 
totic formula which dispenses with the factor s* could do so well. The question, 
indeed, arises whether any graduation formula can throw much light upon 
extreme ages, because of the gross irregularities commonly found at the ends 
of biologic series. 


7. SUMMARY 


The interval of variation is the difference between the greatest and the 
least of n variates in a distribution. Theorems are here given for the greatest 
variate; corresponding theorems can be stated for the least variate, using |< | 
in place of x when necessary. In the following table which summarizes these 
theorems, the letters stand for positive numbers; they are constants except 
and G. Moreover,g<1; butb>1, For each variate 


f y(x) dx is the probability that the variate will be equal to or greater than 2. 


With y(x) = 9, (x)- w(x), the two factors are each described below. As n 
increases indefinitely, a probability converges to certainty that the greatest 
variate will take on the stated asymptotic value, with a relative error small 
at pleasure for the values in Classes J, III, V, and VI, and for 1/e and 1/y in 
Classes II and IV. 


*Transactions of the Actuarial Society of America, vol. 7, p. 200, p. 289. 


GENERAL LAWS OF ERROR 


Asymptotic value of the greatest of n variates 


When each variate is subject to (x)+ ¢ (a).* 


Conditions 
for ¢ (x) 


Class gi (x) Greatest variatetT Applications 


5 Pearson Types 


3 Pearson Types 


a -B 1/a j Gaussian Function 
¢ 2) Grams Series (finite) 
| Jorgensen Logarithmic 


| Function 


| loge (—logg n) Makeham Life Function 


Poisson Exponential 


< |G, with =n B-Series (finite) 


Asymptotic values have a theoretic importance because of the rigidity of 
the determination. Possibly, they may be used unreservedly in problems 
where the variates are as numerous as atoms; but in most practical problems, 
their chief value would seem to be in furnishing a rough check upon mean, 
modal, or median values. The latter can be found by determining G so that 


dx = 


G 


* Or merely subject when x is sufficiently large. 
+ Apart from the factor (1+ ¢’), with /e’|<e, small at pleasure. 


t Provided f (x) + 0. 


UNIVERSITY OF TEXAS, 
AUSTIN, TEX. 


1923] ee 539 

IV 


THE INTERSECTION NUMBERS* 
BY 


OSWALD VEBLEN 


1. In his first memoir on analysis situst Poincaré defined a number 
N(Tx, I'n-x) which had previously been considered, at least in special cases, 
by Kronecker. With certain conventions as to sign this number represents 
the excess of the number of positive over the number of negative intersections 
of a k-dimensional circuit 7, with an (n—)-dimensional circuit 7,-; when 
both are immersed in an n-dimensional oriented manifold. The purpose of the 
present paper is to show how to calculate this number when the manifold is 
defined combinatorially as a collection of cells and the circuits are composed 
of sets of these cells; and to show how the matrices which represent the inter- 
sectional relations between the k-circuits and the (n —k)-circuits depend on 
the matrices of orientation of the manifold. We also define certain modulo 2 
intersection numbers and discuss the matrices connected with them. 

The terminology and notations of the Cambridge Colloquium Lectures on 


Analysis Situs (New York, 1922) will be used without further explanation, 
and the references not otherwise indicated will be to that book. 


2. Let a manifold Jf, be given as the set of all points of a complex C,. 
Let C;, be a complex dual to C, constructed as explained on page 88 by means 
of a complex C, which is a regular subdivision both of C;, and of Ch. Every 
k-cell a’: of C, has a single point P¥ (cf. p. 85) in common with a single 
(n—k)-cell of C}, which is called be *, Our first problem will be to assign 
a positive or negative sign to the intersection of as with b7-*, 

In order to do this, we suppose 1, to be oriented as explained in Chapter IV 
and that all cells, circuits, etc., are oriented. Moreover, in the regular 
complex C,, in which each 7-cell is uniquely determined by its 7+ 1 vertices, 
the orientation of the 7-cell will be denoted by the order in which its vertices 
are written, and the following two conventions will be followed: (1) if 
Ao A; Ax denotes a given oriented k-cell (k = 1,2, ..., m) any even 
permutation of Ap A; --- Ax denotes the same oriented k-cell and any odd 
permutation denotes its negative; (2) the oriented (k—1)-cell A, A, --+ Ax is 
positively related to the oriented k-cell Ap A; --- Ax. 

Presented to the Society under a different title, April 24, 1920. 

+ Journal de l’Ecole Polytechnique, ser. 2, vol. 1 (1895). 

540 


THE INTERSECTION NUMBERS 541 


A simple argument by mathematical induction could, but will not here, be 
given to prove that these notations and conventions are consistent with them- 
selves and with the definition of oriented cells. 

3. The k-cell a’ of C, is made up of a number of k-cells of C, having Pra as 
their common vertex Using the notation of page 86, let one of these be 
denoted by 


0 1 k-1 k 
Ph... 


the points P being chosen, as is always possible, so that the orientation of 
this /-cell agrees with that of a: . In like manner, ba-k is made up of a number 
of (n—k)-cells of C, having pr as their common vusien x, and we let any one 
of these be denoted by 


k Ke -1 
PF P| 


the points P being chosen this time so that the sense of the i-cell which they 
-k 
represent agrees with that of b? “. According as the oriented 1-cell 


0 1 k-1 
P, P; P; 


is positively or negatively oriented, we say that the intersection of a: with 
k 
b; “ is positive or negative. In the first case we write 


and in the second case 
y k Ju-k) — 


From the definition of the points P it follows directly that this definition 
is independent of the particular cells of C, which it employs. It also follows 
that the function N is such that 


‘4 k a—k n—k 
N (aj, GF") = — N(— aj, 


(3.1) 
= —N(at, 


Since the relation between C, and C, is reciprocal, the definition given here 
determines the meaning of V (bi-*, ak), and a simple count of transpositions 
in the notation gives the formula 


(3.2) a) = (—1) N ber"). 


N (ak, = 1 
38 


542 OSWALD VEBLEN [October 
4. The cells of C,, and C, are so oriented (ef. p. 123) that 
Ey = 


which means that at is positively or negatively related to a‘ according as 
v?-** is positively or negatively related to vy}, Now the points P may 
be so chosen that P; --- Pi * represents an oriented cell on a¥-! and 
P; PF represents an oriented cell on By the definition 
in § 2 above, the oriented cell P? P) --- Pi is positively or negatively 
related to Pi... Pf, and therefore to according as (—1)* is 
positive or negative. On the other hand, Pi’... is positively 
related to Pj**?... P;', and therefore to bi". Hence if bi’ is positively 
related to bi-*, is positively related to a* and (—1)* 
is positive or negative according as 

is positively or negatively oriented. A similar result holds if bj ae i 
negatively related to b/"". Hence 


N ( (— 1 yk N ( 1 ~k+1 


By repeated application of this formula we obtain 


N (at, = N(a®, 


But all the »-cells b/ are similarly oriented. Hence the value of N(a°, b”) 
is the same for all zero cells a}, and consequently the value of V(a‘, be-*) 
is independent of j. Hence if the notation is so chosen that )? is positively 


oriented,* 
N(a?, = 1, 


N(al, = —1, 
N (a?, = 1, 
N = —1, 


and all these equations are independent of 7. 


* Cf. Poincaré, Proceedings of the London Mathematical Society, vol. 32 (1900) 
p. 280. 


_ 
— 
= 


1923] THE INTERSECTION NUMBERS 

5. An oriented complex 7, composed of the oriented k-cells a af -- 
counted z' times, x* times, ---. 27* times, respectively, is represented by the 
notation 


(5.1) 
Let 7},-; be an arbitrary oriented complex of Cj, so that 


By the number of intersections of Jj, with £,-;, having regard to sign, we 
shall mean the number V(M%,, 1'}~:-) defined by means of the equation 


= a be-* 


j=1 


If we recall that there are no intersections of cells of 7), of dimensionality 
less than k with cells of M-x and that no cell a intersects a cell b-* unless 
/ = j, it is clear that this definition is in accordance with geometric intuition. 


6. The last equation has as obvious corollaries the equations 


from which it follows that if 7; (¢ = 1, 2, ..., @x) is any set of 4-dimensional 
complexes on which all /-dimensional complexes of C, are linearly dependent 
and (i = 1,2, ..., a set of (2—k)-dimensional complexes on which 
all (x —/:)-dimensional complexes of C}, are linearly dependent, then if 


(6.3) 


and 


(6.4) 


(5.3) 
ak 
== (—1 yi, 
j=1 
ak 
~ vi 
38* 


OSWALD VEBLEN [October 
where the z’s and y’s are integers, then 
ak ak 


j 1 


i=—lj= 


~ 


Hence the intersection numbers of all /-dimensional complexes with all 
(n—k)-dimensional complexes depend on the matrix of numbers V (7%, 
By choosing the complexes 1; and 7,-;, in the normal manner described in the 
Colloquium Lectures this matrix may be given a very simple form, which we 
shall determine in the next three sections. 

7. As proved on page 116 of the Colloquium Lectures, a set of /-dimensional 
complexes upon which all the complexes formed from cells of C, are linearly 
dependent may be so chosen as to consist of (1) a set of Px—1 non-bounding 
circuits which we shall denote by (4 = 1,---, Px — 1), or in Poinearé’s 
notation, 


(7.1) 

(2) a set of 7); cireuits 4, (/ 1, .... 7) which satisfy the homologies 
70) i 

t; 4).~ 0 


in which ¢; represents a i-dimensional coefficient of torsion; (3) a set of rej1 
—r; bounding circuits ©, 


(7.3) Oj). 0: 


and (4) and (5) two sets of complexes @;, and Y, which are not circuits but 
satisfy the following congruences: 


in which ©); and 4,-; are defined by replacing k by /;—1 in (7.3) and (7.2). 
These relations are derived from the matrix equation 


1923) THE INTERSECTION NUMBERS 545 


which arises in reducing (ef. p. 108) the orientation matrix EZ; to normal form. 
The matrix Ej; is one in which all elements are zero except the first 7 elements 
of the main diagonal. The first *—7,—1 of the non-zero elements are 1 and 
the remaining 7,1 are the coefficients of torsion of dimensionality k— 1. 

The first 7—7,-—1 columns of D; represent the complexes ®;, the next 7-1 
columns represent the complexes Wi, the next Py— 1 columns represent the 
circuits 7%, the next — columns represent the circuits O;, the next 7% 
columns represent the circuits 4. Thus, for example, if the jth column of 
Dy (0 <j < is (w1j, +5 We have 


The columns of the matrix C,—1 are the same as the columns of Dy—-1 in 
a different order, and each complex represented by a column of D, is bounded 
by the circuit represented by the corresponding column of the matrix Cy_1- Ei. 
It is from this fact that the congruences (7.4) and (7.5) are derived. The fact 
that Ty: 4k, Gj. are circuits is a consequence of the fact that all elements 
of L; subsequent to the 7th column are zero. 

The homologies (7.2) and (7.3) arise by similar reasoning from the matrix 
equation 


in which it is to be remembered that the columns of Cy are the same as those 
of D, in a different order. 

8. The (x —k)-dimensional complexes required in the formulas of § 6 may 
be determined by the same process as described in § 7, from the matrices of 
the dual complex C),. The matrices of the dual complex are related to those 
of C,, by the equation (ef. p. 123). 


(8.1) = 
in which Z,—, is the matrix of the relations between (n—k—1)-cells and 


(n—k)-cells of Ci, and Ej-41 is the matrix obtained by interchanging rows 
and columns of Fj.11. The equation (7.8) gives the following: 


v—1 
Ck: Diss = 1. 


y-1 
Ce 


546 OSWALD VEBLEN [October 


The columns of C;,~ determine a linearly independent set of complexes 
analogous to those determined by the columns of D;. They are described by 
the following homologies and congruences, written in the order of the columns 


of : 

(8.5) 0, O<y 
(8.6) ti 44. 0, O<j 


9, Since the columns of D;, ave the same as those of C;, in a different order, 
and the columns of C, ~~ are the same as the rows of C; , the matrix equation 


(9.1) 1 
implies the relations 


10 if + p 


between the columns (#1j;, #2j, ..., a) of Dx and the columns 
(wip, ip, +++) Vay) of Cr*. But by (5.3) this implies that the intersection 
numbers of 41, ete., with ete., are zero except in the following 
);; cases, Written in the order of the columns of Cr?: 


(9.3) N (Qh, = S Tk} 
(9.5) = (—1k 0<j< Pe—1; 
(9.6) N(@j, Of.) = v2, O<j 


(9.7) N (Bk, = 0<j St%-1. 


‘ 


1923] THE INTERSECTION NUMBERS 547 


Thus, each k-circuit 7; intersects the corresponding (n—k)-circuit once 
and intersects no other of the fundamental (2 —)-dimensional complexes. 
None of the other k-circuits (A or ©) intersects any (2 —k)-circuits, but 
each ©; intersects a complex @,_; which is bounded by @,-x%-1; and each 
intersects a complex which is bounded by k-1 counted times. 
Thus we may say that each ©; links one and only one @,-;—-1 once and each 
A links one 4,-;-1 in a manner which may be described as a fractional 
number of times, +1/c%. A further study of these linkages would carry us 
beyond the bounds of the present paper. 

10. The matrix spoken of at the end of § 6 is now seen to consist entirely 
of zeros except for @, elements whose value, 1 in every case, is given by 
equations (9.3), ..., (9.7). If we limit attention to circuits the only non-zero 
terms which remain are those given by the intersections of ’i, ..., Tx’ > with 
the corresponding non-bounding (2 — k)-circuits. The matrix is therefore one 
which consists entirely of zeros except for the first P,—1 terms of the main 
diagonal which are all 1’s. For any k-circuit 1, of C, we have 


= «." 
(10.1) Dalit 
Sam 


and for any (2 —/;)-cireuit of we have 


P-1 


(10.2) = > D> 


= i=1 
When these expressions are substituted in (6.5) there results 


P,-1 


i=1 
Thus we have the theorem that 7/ 


P-1 


Tk 
(10.4) > xi Ti+ yi 


*=1 
and 
P,-1 Tk 
& 


i=1 


then the intersection number of Tj; with Ty—x is given by (10.3). 


k—1— 
i 
> 
i=1 
= 
Ox. 


548 OSWALD VEBLEN [October 


This theorem has the corollary that 
Ta-~) = 0 
if and only if at least one of the homologies 
plik ~ 0 or ~ 0 


is satisfied for some integer value of p or q. In other words, the statement 
pT ~ 0 is equivalent to the equation 


N (lz, = 


for the one circuit J), and all circuits ,-;.. 
From this it follows that if 7 is any k-circuit composed of cells of C,, and 
such that 


Ti; 
then 


11. Incidentally it may be remarked that (10.4) and (10.5) give rise to the 
following “homologies with division allowed”: 


xr Ty, > yi 
§==} 


Whenever these homologies are satisfied the equation (10.3) is satisfied. As 
remarked by Poincaré, it is because the intersection numbers are more closely 
related to the homologies with division allowed than to the ordinary homo- 
logies that his attempt to prove the Euler theorem and the theorem about 
the duality of the Betti numbers by means of the intersection numbers was 
unsuccessful. 


1923] THE INTERSECTION NUMBERS 549 


12. The fundamental sets of cireuits which appear in the formulas of § 10 
are chosen in a very special manner. A perfectly arbitrary fundamental set 
of k-circuits is however related to this special set by homologies 


in which the (P;,—1-+ rx)-rowed determinant — +1. A general funda- 
vi 


mental set of (#2 — k)-circnits 7,7; is related to the special set by an analogous 
set of homologies. Hence the matrix of the intersection numbers 


N(Fi, 


is one of P);—1-+-7, rows and P;,—1-+ 7~1 columns, of rank P;,—1 and 
having all its invariant factors unity. 

13. For some purposes it is desirable to introduce intersection numbers 
Which do not distinguish between positive and negative intersections. The 
theory of these numbers is much simpler than that which we have been deve- 
loping because all the determinations of algebraic sign in §§ 2, 3, 4, 5 can be 
omitted. We simply replace the definitions of § 3 by the agreement that 


= 1 or O 


according as a; and b; ~~ have a common point or not. Then the definition in 
$5 is replaced by 


ar 


=> 
j 1 


J 


the sum being taken modulo 2. 

The determination of the intersection numbers of fundamental sets of 
k-cireuits and (n—/s)-circuits in §$ 7, 8, 9 is replaced by an analogous theory 
based on the matrices A;—; and B; which arise in the reduction of the inci- 
dence matrix H;, to normal form (cf. p. 79 and following pages). The result 
obtained is that there exist a set of k-circuits Ti: Ti, ..., Ti and a set of 


a 


j — 1 f 


P,-1 P,-1+1, 
a; Tj, +- AN, 
j=1 j=P, 


550 OSWALD VEBLEN 


and if 


then 


== i Yi (mod 2). 


It should be observed that these formulas cannot be obtained by reducing 
the formulas of § 10, modulo 2, because the formulas of the present section 
take account of non-orientable circuits which do not enter into the theory of 
oriented intersections. 

PRINCETON UNIVERSITY, 

PRINCETON, N. J. 


R,- 1 
= Li , 
R,—1 
= 
i=1 
R, 1 


THE GEOMETRY OF PATHS* 


BY 


OSWALD VEBLEN AND TRACY YERKES THOMAS 


1. Introduction. The first part of this paper is intended as a systematic 
general account of the geometry of paths and is largely based on the series 
of notes by Eisenhart and Veblen in volume 8 of the Proceedings of the 
National Academy of Sciences. The general theory is carried far 
enough to include an account of a series of tensors defined by means of normal 
coérdinates, and also a series of generalizations of the operation of covariant 
differentiation. We then turn to a special problem, the investigation of the 
conditions which must be satisfied by the functions 7 in order that the 
differential equations of the paths shall possess homogeneous first integrals.+ We 
first solve a still more special problem for first integrals of the nth degree (§ 15). 
This includes as a special case the problem solved by Eisenhart and Veblen in 
the Proceedings of the National Academy of Sciences, vol. 8 (1922), 
p. 19, of finding the conditions which must be satisfied by the J’s in order 
that the equations (2.1) shall be the differential equations of the geodesics of 
a Riemann space. ‘ 

Finally we solve the general problem for the linear and quadratic cases; 
that is to say, we find algebraic necessary and sufficient conditions on the 
functions Fin order that (2.1) shall possess homogeneous linear and quadratic 
first integrals. The method used will generalize to homogeneous first integrals 
of the nth degree. We leave unsolved all the projective problems which corre- 
spond to the affine problems which we have solved. For example, the problem 
remains open to find what condition must be satisfied by the 7s in order that 
one of the sets of differential equations which define the same paths as (2.1) 
shall have a linear first integral. 

2. The geometry of paths. Consider an n-dimensional region the points 
of which can be represented by coérdinates (z', 2*,..., 2”). Also consider 
a set of differential equations 

* Presented to the Society, October 28, 1922, and April 28, 1923. 

7 Our problem is distinguished from the problem of the existence of first integrals in 
dynamical systems (studied by Staeckel, Painlevé, Levi-Civita, and others) by the fact that 
the dynamical problem presupposes the existence of the integral corresponding to the 
fundamental quadratic form. Cf. Ricci and Levi-Civita, Méthodes de calcul différentiel 
absolu, Mathematische Annalen, vol. 54 (1901), p. 125. 

551 


5D? 0. VEBLEN AND T. Y. THOMAS [October 


a? xt i da? 
( 
(2.1) ds? ' ds ds 
in which 
(2.2) = Vy. 


In these expressions the subscripts and superscripts take all integer values 
from 1 to x and the convention is employed that any term which contains the 
same index twice, once as a subscript and once as a superscript, represents 
a summation with respect to every such index. Thus the second term represents 
a quadratic form in da’/ds. The coefficients are arbitrary analytic functions 
of (x, a*,..., 2"). The condition (2.2) is no restriction on the differential 
equations (2.1) because the coefficients of any quadratic form can be written 
so as to satisfy (2.2). 
Any curve 
(2.3) = wu (s) 


which satisfies (2.1) is called a path and the theory of these paths is what we 
call the geometry of paths. 

The geometry of paths is a natural generalization of the euclidean geometry. 
For the differential equations of the straight lines in an n-dimensional eucli- 
dean space are 

d? x! 


(2.4) 


ds* 


when referred to a cartesian coérdinate system. An arbitrary transformation 
of the coérdinates 
(2.5) gi(y', y") 


transforms (2.4) into a set of differential equations of the form (2.1) in the 
variables y, in which 


n2 
oy’ 
Ox" oy 


Hence the system of paths defined by (2.1) has the properties of the straight 
lines of euclidean space whenever the functions / are such that (2.1) can be 
transformed by an analytic transformation into (2.4). This transformation is 
possible if and only if 


ar 


1923] THE GEOMETRY OF PATHS 553 


as can easily be proved. The left member of this equation is denoted by Bix, 
and is called the curvature tensor. 

The paths defined by (2.1) are the geodesics of a Riemann space in case 
the 7’s are such that there exists a quadratic differential form 


(2.8) = da da? 
such that 


(2.9) Vi: —gje Tin = 0. 


O7 


In this case the paths are the geodesics of the differential form (2.8). 

The geometry of paths reduces to a Weyl metric geometry if the 7s are 
such that there exists a linear form y,, da and a quadratic form g,,3 da“ da? 
such that 

; 


1 
(2.10) Gor Vij 9 


(ef. H. Weyl, Rawm, Zeit, Materie, 4th edition, p. 113). 

In the general case (no restriction on the I’s except (2.2)) the geometry 
of paths is equivalent to the geometry of infinitesimal parallelism as developed 
by Weyl, in Rawm, Zeit, Materie (4th edition, p. 100). For any system of 7's 
which appear in the differential equations (2.1) can be used to establish 
a definition of infinitesimal parallelism according to which the paths defined 
by (2.1) are geodesics in the sense of Weyl. 

3. Transformation of the dependent variables. Consider an arbitrary 
analytic transformation of the coérdinates 


which may also be written 

(3.2) f(z", z*,..., 2"). 
By substituting (2.3) in (3.1) we obtain 

(3.3) xt = Ysi(g) 


as the equation of the path represented by (2.3). Since 


(3.4) 


dat da° dx“ 
ds ads 
39 


Hd4 O. VEBLEN AND T. Y. THOMAS | October 


and 
(35 at dz” dz’ dx“ 
ds* ds ds 
we find that this path satisfies the differential equation 
, r dx” ax? 0 
») ds*® “Pods ds 
in which 
Out 0? a da? dar 


Thus the form of the equation (2.1) persists under a transformation of 
codrdinates. It follows from (3.7) that the functions /' behave like the com- 
ponents of a tensor under linear transformations with constant coefficients 
but not under more general transformations. It is seen by an easy computation 
that the functions Bj defined in § 2 are the components of a tensor. It follows 
at once that the equation 


ast 
(3.8) Sa = Bur = 


defines a tensor which is skew symmetric. This tensor is identically zero in 
the Riemann geometry. It also follows that 


0 ol 


/ jk — ji Deck 


(3.9) Rx = Bixi 


is a tensor. This we shall call the Ricci tensor because it reduces to the 
tensor studied by Ricci* for the case of the Riemann geometry. It is symmetric 
if and only if Sj; = 0, as is obvious on comparing (3.8) and (3.9). Further 
properties of these tensors are to be found in a paper by Eisenhart in the 
Annals of Mathematics (vol. 24). 

For convenience of reference we put down here the following formulas 
about transformation of coérdinates in general: . 


“G, Ricci, Atti, Reale Istituto Veneto, vol. 63 (1903), pp. 1233-1239. 


| 


THE GEOMETRY OF PATHS 


aa aah Oa Ox! 


(3.13) 


Oa! 


0 


dar 


Ox"? 
(3.15) 


4! 


aac Ox! Ori 


ax? 
ork 
(3.16) 


= 


4. Transformation of the independent variable. If we make an 
arbitrary analytic substitution 
(4.1) s = f(t), 


(4.2 t = g(s). 


1923] 
i 
(3.10) == gj; 
okt 
3 a Ox! Oak 0 
27 
0: 
(3.14) 
07a Oat 07a" or! bar 0 
ork oan jah Ox! 


5D6 9. VEBLEN AND T. Y. THOMAS | October 


in the equation of a path (2.3) the latter becomes 


(4.3) a’ = 
Kor this path we have 


(4.4) dst lf? dt 


ds 


On comparison with (2.1) we see that the equation (4.3) satisfies the differen- 
tial equation 


(4.5) df dt dt ds* 
da’ dt \? 
dt | ds 


Hence the differential equations 


dat da“ 4 da” da? 

d@ dt at ' dt at 
(4.6) 
dat dat 
dt dt 


are satistied by the equations of the paths and are such that they continue to 
be satisfied if the independent variable in the parameter representation (4.3) 
of any path is subjected to an arbitrary transformation. 

From (4.5) it is evident that the differential equations (2.1) will continue to 
be satisfied if the independent variable in the equations of a path (2.3) be 
replaced by ¢ where 
(4.7) t as+h, 


a and } being constants. 

The differential equations (4.6) are due to J. L. Synge, who has pointed out 
that the system of paths defined by them is no more general than that defined 
by (2.1). For, suppose that (4.3) satisfies (4.6). Let @(/) be the function of ¢ 
obtained by substituting (4.3) in any of the expressions whose equality is 
asserted by (4.6). The following equation is satisfied by (4.3): 


x! 4 ri da? 
dt “8 dt dt 
dxt 


dt 


(4.8) M(t). 


1923] THE GEOMETRY OF PATHS 
Substitute (4.2) in (4.3), obtaining 


an equation of the paths which must satisfy 


da® da? 
ds 
dt 


(4.9) 


Now if 


(4.10) s S(t) = A+ B 


A and B being constants, (4.9) reduces to (2.1). Hence the equations of any 
path defined by (4.6) may be written as solutions of (2.1). 

5. Projective geometry of paths.* Let us inquire under what cireum- 
stances a set of differential equations 


6.1) ds* 


can represent the same system of paths as (2.1). Suppose that a curve 
(5.2) at = g'i(t) 


is a path both for (5.1) and for (2.1). The functions g'(¢) are not necessarily 
solutions of (2.1) or of (5.1), but they are solutions of Synge’s equations (4.6) 
and also of the corresponding equations determined by (5.1), i. e. of 


dad dat dr? dat da” da? 


(5.8) dt ae dt dt dt \ dé + dt dt 


*The discovery that the same system of paths arises from (5.1) as from (2.1) when (5.5) 
and (5.8) are satisfied is due to Weyl, Géttinger Nachrichten, 1921, p. 99. See also 
Eisenhart, Proceedings of the National Academy of Sciences, vol. 8 (1922), p. 233, 
and Veblen, ibid., p. 347. In the latter paper in equation (2.6) the final ¢ should be omitted 
and dx'/dt,...,dx"/dt should be evaluated at the point q; also the integration signs are 
missing in (4.2). 


a j s 
ds* df? 
dat ds 
ds | dt 
da® da? 0 
“Beads ds 


558 0. VEBLEN AND T. Y. THOMAS [October 


Between (5.3) and (4.6) we can eliminate the second derivatives, thus obtaining 


daf da? (Meg da® dat 


(5.4) _ om: 
da! dt dt dao dt 
dt dt 

Let 

(5.5) Tag — Aug = 

and 

(5.6) 

o. 1 ip pe 


If we subtract from (3.7) the corresponding equations for the functions 7 the 
result shows that 0%, 3 is a tensor. Hence ®,; is a vector. The equation (5.4) 
now becomes 


dat da® 
| dt «3 dt dt dt 
In this we put 
dat dat da dat 
dt v dt and dt vy dt 
and obtain 
da? 
“y “dt dt dt 


Since the derivatives /2“/d/ may be chosen arbitrarily this gives 
i si i si ii ai i yi ji 
+ Dy, + Dye = 9, Dy, + Dy, 


7 « 
If we set) = y in this equation and sum with respect to y. we obtain 


Hence 


Hence, 'if the equations (5.1) and (2.1) are to determine the same system of 
paths, the functions 7 and -/ must be related by (5.5) and (5.8). 


1923] THE GEOMETRY OF PATHS 559 


Conversely, let ®, be any covariant vector and let the tensor O.,3 be defined 
by the equations (5.8). Then any two sets of differential equations (2.1) and 
(5.1) will define the same system of paths, provided that (5.5) is satisfied. 
For consider any path with respect to the /"s. Along this path we have 


dak “dt dt (V3 ep) daf 
© dt dai dt dt 
dt dt 


Hence (5.4) is satisfied. But if (5.4) is added to (4.6) the corresponding 
equations in -/ are obtained. Hence every path with respect to the /’’s is 
also a path with respect to the -/'s. 

A system of functions lop determines a definition of infinitesimal parallelism 
in the sense of Levi-Civita and Weyl. It is therefore appropriate to designate 
the body of theorems which state those properties which are determined by 
a particular set of differential equations (2.1) as an affine geometry of paths. 
In like manner the body of theorems which state properties of a system of 
paths independently of any particular definition of affine connection (i. e. of 
any particular set of differential equations (2.1)) may be called a projective 
geometry of paths. 

For example, the theory of the curvature tensor belongs to the affine geo- 
metry of paths. For if the curvature tensor determined by (2.1) is denoted 
by Bepy as in § 2, the corresponding curvature tensor determined by (5.1) is 


In this expression ®,, ¢ denotes the covariant derivative (ct. $ 10 below) of ®, 
with respect to the functions /%,g. 
The Ricci tensor &,,3 becomes 


(5.10) Res +n ®,, 3 D 3 +in—1)®, D;, 
and the skew symmetric tensor S,,; becomes 


(2.11) Seg — (+ 1)( 3— 


560 0. VEBLEN AND T. Y. THOMAS [October 


Comparing these three expressions, it is evident that a tensor which is the 
same for the /’s as for the /’’s is defined as follows: 


This is what Weyl (loc. cit.) calls the projective curvature tensor, and its 
theory belongs to the projective geometry of paths. It can also be written in 
the form 


«py i Rey + R,,.) 
i 
(n Rex + Bie) + - (Rg — 
n?—1 n+] By ve” 


In the rest of this paper we shall be concerned entirely with the affine 
geometry of paths, to which we now return. 

6. Equations of the paths. A unique solution of (2.1) in the form (2.3) 
can be found which satisfies a set of initial conditions 


(6.1) = w'(0), 
d 
(6,2) = w’ (0), 
ds 
where q', q*,...,q” and &', &,..., & are arbitrary constants. For if we 


differentiate (2.1) successively we obtain the following sequence of equations: 


Ad? x! i da” da? 


ds? + «p ds ds 0, 


(6.3) | r da? da® 
“py ds ds ds 
dx dz? dat dx® 


dst | «pyo ds ds ds ds 


e 
. . . . . . . . . 
. . . . . . . . . 
. . . . . . . . 


1923] THE GEOMETRY OF PATHS 
in which 


ri 
(64) = - 


+t 


P| — Vex — Vie Ua) = 3 P| 
and, in general, 


a 


(6.5) 


a 


0a" 


where denotes the number of subscripts, and the symbol P denotes the sum 
of the terms obtainable from the ones inside the parenthesis by permuting the 
set of subscripts cyclically. Thus the functions /z...... have the property of 
being unchanged by any permutation of the subscripts.* The equations (6.1), 
(6.2), (6.3) determine immediately the following series for ¥ in terms of s: 


In this expression represents the value of obtained by 
giving the value In general we shall use « to represent x*,..., 2”), 
£ to represent (&', €*,..., €”), and so on. For any point q and any “direction” & 
we have a unique path determined by (6.6). These equations may be abbrevi- 
ated in the form 


(6.7) =x (q,&s). 


The jacobian of the equations (6.7) is equal to unity. Hence for values of 
x sufficiently near to q the equations can be solved, giving 


(6.8) gs = gi 


where 7 is a multiple power series in (a* — gq‘), beginning with second order 
terms. Hence there is one and only one path joining q to «. 


* We have changed the notation used by Veblen in the Proceedings of the National 
Academy of Sciences, vol. 8 (1922), p. 192, in order to introduce this symmetry. 


562 0. VEBLEN AND T. Y. THOMAS [October 


7. Normal coérdinates. Let us now put 
(7.1) y = 


The equations (6.7) and (6.8) become 


(7.2) (gy) 
and 
(7.3) y = AM ( @). 


These equations may be regarded as defining a transformation trom the codér- 
dinates (7', 2”) to a new set of codrdinates (y', y®,...., y”) which we 
shall call normal codrdinates because they reduce to Riemann’s normal coér- 
dinates in case the geometry of paths reduces to a Riemann geometry. This 
transformation changes the differential equations of the paths (2.1) into 


dy” di 


= 0), 
ds? «p ds ds 


(7.4) 


where Cas are functions of y defined by the equations 


‘ 2 3 


(7.5) ce = a 
jk A 0 y! y* By a y 0 


These coérdinates have been so chosen that the curves defined by (7.1) are 
the paths through the origin. If we take any point y there is one and only 
one of the paths (7.1) which passes through it. Substituting (7.1) in (7.4) 
we find 


(7.6) Caf = @, 


and hence on multiplying by the square of the value of s determined for the 
point y by the equation (7.1) we obtain 


(7.7) = 9. 


Let tus now consider the effect of a transformation of the variables x of 
the form (3.1). This changes the equation of a path (2.3) which satisfies (2.1) 


1923] THE GEOMETRY OF PATHS 563 


into the equation (3.3) which satisfies (3.6). It also changes the initial con- 
ditions (6.1) and (6.2) into 


(7.8) qi = #(0) = fi(q) 
and 
/ dw (0) ax 
(7.9) ds ds | 


respectively, the subscript q indicating that the derivative is evaluated for 
x == q. For any point p, not too far away from q, there is a unique path 
and thus a unique set of values (y', y’,..., y"). From these we determine 
Sand s so that y/ == &s. Then (3.3) gives the equation of the same path in 
terms of the coérdinates » in such form that the point » is determined by the 
parameter s. Hence by (7.9). 


(7.10) = | 


In this formula the coefficients (=) are independent of the particular path 
q 


and dependent only on the point gy and the two coérdinate systems. Hence 
when the codrdinates « undergo an arbitrary analytic transformation, the 
normal codrdinates determined by the coordinates x and a point q suffer a linear 
homogeneous transformation (7.10) with constant coefficients. In other words 
the normal coérdinates are transformed like contravariant vectors. They are 
not vectors, however, in the narrow sense, but are the components of a “step” 
from the origin of the normal coérdinates to the point at which the codérdinates 
are taken. An arbitrary step (4B) determined by the points A and B can be 
represented by the codrdinates of the point B in the normal coérdinate system 
associated with the point A. 

8. Alternative treatment of normal coérdinates. The identity (7.7) 
can be used as the definition of the normal coérdinates. For by (3.12) 


(8.2) 


da) Oak! By” ay? 


so that (7.7) becomes 


564 O. VEBLEN AND T. Y. THOMAS [October 


The differential equations (8.2) uniquely determine a functional relation 
between the «’s and the y’s when taken in conjunction with the initial 


conditions 
(8.3) y = 0 whens’ = 


For when we differentiate (8.2) repeatedly and substitute these initial con- 
ditions, making use of the formulas at the end of § 3, we find 


(8.5) 
and 
i i i « p 
(8.6) 


where the 7s have the meaning given them in § 3 and the -/’s are such that 
vt 
= Vix, 


klm 


If the general solution of (8.2), regarded as a differential equation for y in 
terms of x, is denoted by y when only the initial conditions (8.3) are imposed, 
then 


(8.7) 


| 
3 | xt — Tit) , 


1923] THE GEOMETRY OF PATHS 565 


where the a’, are arbitrary constants, and y is the solution determined by the 
initial conditions (8.3) and (8.4) which is given by (8.6). This last theorem is 
proved by observing, first, that the function y defined by (8.7) satisfies (8.2) 
and (8.3) and, second, that if there were any other solution for which 


when a? = q', the solution (8.6) would not be uniquely determined by (8.4). 

In order to show the tensor character of the normal coérdinates let us now 
consider the effect of a transformation of the variables x of the form (3.1). 
We inquire what are the normal coérdinates determined by x', .7*,.... 47°”. 
These normal coérdinates which we shall denote by y', y®, .... y” are 
solutions of 


ik ‘ 
J 02a ay” ay? 


If we substitute into this the value of 7% from (3.7) we obtain 


(2 Ox? . at Ox? ay! dad Oak 
Ors Ox" Gat] On! Oa" ay” ay? 
or 
Ox ay" Oy! 
(8.9) 


The parenthesis in the second term is identically zero. Hence (8.9) is the 
same differential equation as (8.2). 
By definition the normal coérdinates must satisfy the initial conditions 


= 0 when = 
and 
ay? 


i 
= 0 and = 4; 
i 42 


566 0. VEBLEN AND T. Y. THOMAS [October 


Hence we must have 


ay! Ox! 
Ox! Ox 
The value of 07‘/0a/ when x‘ — q‘ is determined by (3.1). Let us call 


it ai. Then by the theorem regarding the formula (8.7), 
(8.10) y= ay 


is a solution of (8.9) determined by the conditions 


0 i 
= and = Jj when = q. 


Hence when the coérdinates 2 undergo an arbitrary transformation (3.1) the 
normal coérdinates undergo a linear transformation (8.10) the coefficients of 
which are given by 

(@) 


j 


(8.11) 


y. The normal tensors. Since Cj, is symmetric in j and 4 and & is 
entirely arbitrary it follows from (7.6) that Cj, vanishes at the origin of nor- 
mal coérdinates, i.e.. 


(9.1) » = O. 


Hence the power series for a takes the form 
in which the A’s are the derivatives of Cj, evaluated at the origin, i.e., 


Ci. 
(9.3) = ). 
0 


The equation (9.3) can be taken as defining Ajxe...c as a set of functions of 


At any point (p’,p*..... p"). Ajne...c is equal to the right 


‘4 


1923] THE GEOMETRY OF PATHS 567 


hand member of (9.3) evaluated in the system of normal coérdinates having 
(p'. p®,..., p”) as origin. The functions so defined are tensors. For consider 
a transformation from « to « and the transformation which it produces from 
y to y. By (3.7) we have 


‘ 3 
c OF i oO 
(9.4) 


ay" by) 
and trom (9.2) 
i Oy” i oy oy i oy of 
Hence 
ay? dy" dy? 
21° Pres dy’ 
Comparing this with the equation 
we have 
If we make the substitution 
bat 
(9.5) 4 
Oy! Or 
then 


where A is regarded as a function of (#',*,...,2”") and A as a function of 
(x',77,...,2"). This shows that Aju... is a tensor which is contravariant 
in ¢ and covariant in jk/---m. We shall call it a normal tensor because of 
its definition in terms of normal codérdinates. 


568 0. VEBLEN AND T. Y. THOMAS [October 


By their definition (ef. (9.3)) these tensors are symmetric in the first two 
subscripts and also in the remaining ones, i. e., 


(9.7) A .. Anjag...t ’ 
i i 
(9.8) Ajneg. Ajnya.. 


where yO... is intended to represent any permutation of @8...7. 

If we multiply (9.2) by / y* and sum, the left member is zero by (7.7) and 
the right member is a multiple power series the coefficient of each term of 
which must be zero. It therefore follows that 


(9.9) + = U, 


i i i i i i 
and in general 


i 


(9.11) = 


where S( ) stands for the sum of the \V(.V—1)/2 terms obtainable from 
the one in the parenthesis and not identical because of (9.7) and (9.8). 

The tensors A are expressible in terms of the functions / and their 
derivatives. If we differentiate (7.5) we obtain 


oy “ay dy dy 
(9.12) 


Substituting the values of the partial derivatives of « with regard to the y's 
as computed from (6.6) or (8.5) for the origin of normal coérdinates, we find 


dat 


1923] THE GEOMETRY OF PATHS 569 


If we differentiate (9.12) again we obtain 


a2 
44 jklm 0 gl jkim y! lem dat jm kl 
1 


It is evident that a continuation of this process will determine the explicit 
formulas for any number of the A’s. 

10. Covariant differentiation. Covariant differentiation is a process by 
which from a given tensor there may be formed a new tensor with one more 
covariant index. Let 77/""%" be any tensor referred to arbitrary codrdinates 
(x, 2*,..., 2”) which is contravariant in (7, m,..., and covariant in 
(i, j,...,k). Let be the components of in the normal codr- 
dinate system y°,..., y”) which is determined by the x-coérdinate system 
at the point (q', q*,..., q"). The equation 


Im...” 
0 


yp 


defines a set of functions of x which turn out to be the components of a tensor. 
In the Riemann geometry this tensor is the same as the covariant derivative 
of T according to the definition of Ricci and Levi-Civita. Hence we shall 
call it by the same name in the general case. The subscripts arising by 
covariant differentiation will be separated from those originally present in the 
tensor by a comma. 

Let us now prove that the functions Oe actually are the components 
of a tensor. Let the functions 7’ and ¢ become 7 and ¢ respectively under the 
arbitrary transformation (3.1). This gives the equations 


j...k = ay? ay” ay? 


40 


1 4p 
|_| 


570 0. VEBLEN AND T. Y. THOMAS [October 


Since the derivatives in (10.3) are constants we obtain by partial differentiation 


Oleg... dy" oy" ay? dy? By” 
dy? ay” dy? ay ay” ay? oy dy" dy? 
and, hence, at the point g we have 
(10.5) = OB...7,0 x8 Ox dx Ox? 


Since the point g is arbitrary, 77)" Typ is a tensor which is contravariant 
in (7, m,..., and covariant in (i, j 

Let us next evaluate 7%/"'%’> in terms of the Z’s and the original 
tensor 7%"";.". To do this we differentiate the equations 


> lm...” ¥ dy! oy” oy" 
obtaining 
dx” Oy? ax” oy dy 
Ox Oy oy" oy 
At the origin of normal coérdinates 
(10.8) - = = jj, —— 


as follows directly from (8.5) and its inverse (8.6). The substitution of (10.8) 
into (10.7) then yields 


lm...n 

m 
(10.9) 


im... 


1923] THE GEOMETRY OF PATHS 571 
By using the tensors 
aab...n oh n a n sa gb n 
(10.10) Derg = 6, 6, 05 +0) +45 0, 0,---d, +--- +0, 05 02 
and 
(10.11) = 9a ++ On +04 0a +0, da 
the formula (10.9) may be written in the form 
9 Tim n y ol 
(10.12) 
pim...n 


The covariant derivatives of the sum and of the product of two tensors 
with the same number of covariant and contravariant indices are formed by the 


same rules as hold in the differential calculus. That is, if 


mim... In l 
(10.13) => ke + 
then 
rpm... lm... | Im... 
(10.14) = + By. ip, 
and if 
rim... 
then 


These formulas follow without difficulty from (10.1). 


11. A generalization of covariant differentiation. By repeated 


differentiation of (10.3) we obtain 


dy?... dy! ay7... dy? 


oy” dy" oy? dy 
ay oy? by ay 


(11.1) 
2 
“ee ip 7? ay? . 
40* 


572 0. VEBLEN AND T. Y. THOMAS [October 


yim. 


This shows that a set of tensors 77" i/p...q are defined by 


Im. 
ti a 
(11.2) ™ = 1,2,3,...), 


where the derivatives on the right are evaluated at the origin of norma] 
codrdinates. For r= 1, the ordinary covariant derivative j 
that we have just considered. The tensors Ti": ‘kp...q form a group of tensors 
that may be derived from a given tensor. We shall refer to the general 
tensor of this group, namely, 77?"";”)...2, as the rth extension of Ty”"%", 
r being the number of indices p,..., q. By its definition this tensor is sym- 
metric with respect to the indices p, ..., g. The operation of forming the 
extension of a tensor may be repeated any number of times. For example, 
Tj, pq,r,stu i8 the third extension of the first extension of the second extension 
of Ty. 

The rth extension of the sum of two tensors which are of the same order 
in their covariant and contravariant indices is equal to the sum of the rth 
extensions of the two tensors, i. e., 


9) 


This follows directly from the character of the tensor transformation. The 
formula for the covariant derivative (first extension) of the product of two 
tensors does not apply, however, for the case of the 7th extension (r>1). 
For let the tensor 7 be equal to the hg of two tensors as in the 


equation (10.15). If 7, A, B become ¢, a, ) in a normal coérdinate system 
(y', ..., we have in this sy 

Im.. Pere 


The formula for the rth extension of 7'is obtained by carrying out the differen- 
tiation indicated in the following equation: 


at 


This formula has 2” terms. 
im. 


Any tensor .q may be expressed in terms of the and the 
original tensor Ti " by the same process that we have used for the case 


1923] THE GEOMETRY OF PATHS 573 
of the tensor 7;;"";;",. That is we have to take the successive partial deri- 
vatives of both members of (10.6) and substitute in these equations the 
equations (10.8) and 


(11.6) — (Tao, = (Aino, 


and so on. It is evident that formulas for extensions of all kinds can be ob- 
tained by this process. Instead of giving the general formulas, however, we 
shall set down the first extensions of the first four kinds for the covariant 
vector and the covariant tensor of the second order. In these formulas S is 
used to indicate the sum of all distinct terms which can be formed from the one 
in the parenthesis by replacing the given combination of the subscripts p, q 
or p,q,7 or p,q, 7,8 by arbitrary combinations of these subscripts. Thus 


a2 7. a2 q2 7". q2 7. 


Ox? Ox! Ox" Ox! 
(11.9) 
+ Vip Tar) — 8 Vege) — — Ta Tiger 
Oz } \ O 
Ti. pare = 8 Is) — 8 
Ox? OX" OxP Ox! OX? OX’ 
a? 
+ 8{- — Us) +8 Us) — 8 (- 
Ox" ‘Ox? da! 
(11.10) 
42 a7 \ ar \ 
— + 8 Vins) + 8 Ving 
0 xt Oa! Oat 
ry’ 
0 T; m yet 


U a 


574 O. VEBLEN AND T. Y. THOMAS [October 
(11.11) = ap Vip — Vie Vip 


6 
4 
(11.12) 


‘pq Jpq? 


Tice 


Tj, py 


r Pp re) 
p q JT q 


Tia 
(11.13) +8 


par ap ip” jq « ipq 


— Te Fipgr — Dice Vipar 


a3 7 as 
T;; Ti; 0 re 0 1 ie r) 
a3 TT, a2 T 43 T. 


a2 7 7, 
a2 a2 yy. an. 
S| Tij_ ré ré\— | re (= Taj re 


a ip 


+ Tic re | +g r eri oTy re 
Jp qrs| Bat pars 


— Te; 1 S8( Tes re.) 


+ Tus SUE — T,, 


Jjqrs ia jpqrs* 


| 
| 


1923] THE GEOMETRY OF PATHS 575 


12. Formulas for repeated covariant differentiation. In this section 
we write down a few special formulas relating the tensors obtained by 
successive covariant differentiation to the higher extensions and the normal 
tensors. In each case the formula is obtained by computing the covariant 
derivative in question according to the formulas of §10 and evaluating at the 
origin of normal coérdinates: 


(12.2) 


i,p,q,7,8 i, pars ~ irs a,pr igs a, ps 


a,qr ~~ ips a, qs ipr ~~ ipq 


~ prs i,ar “pqs i,a@s a,p~ igrs 
(12.3) 
—T AP — AC — Ae AP 
Bpr igs Bps iqr Bir ~ pqs 
— « a 
(12.4) Ts, Aing — Aj ‘ipa? 
« 7 Aa 
per Ti, AL, — Ly, q — Aing 
(12.5) Tice, p Aj jar r iy 
ij, pars Tj Aipars T; « 


1a, Jprs Jpqs 1a,3 Jpqr 


| 


) 0. VEBLEN AND T. Y. THOMAS [October 


* igrs ia, p pars 


«j,qr *-ips «j,qs ~~ ipr «j,rs ipy 


Al 
1 i@,rs 


— At 


ia, qr *"jps i, qs 


~ irs “j,p,r igs Gj, iqr 


ia,p.q jvs ie, p,r ia, p,s jqr 


« « 
ij,@,q prs 


— ‘yy 
ij,p,@ *"qrs 


(12.7) 


13. A generalization of the normal tensors. If we transform the 
equations (6.3) to a system of normal coérdinates and make use of (7.1) we 
obtain the following sequence: 


Capt = 0 
i 
(13.1) 
( apy.. > > 0, 


where thé C’s denote the corresponding functions 7’ in normal codérdinates. 
Ciy...¢ iS symmetric in the indices «fy --- 6. The functions are related 


| 


1923] THE GEOMETRY OF PATHS 577 


to the Z’s by (7.5) and there are similar equations of transformation for the 
other C’s. Thus 


(13.2) 


Since the §’s are entirely arbitrary at the origin of normal coérdinates it 
follows from (13.1) that 


(13.3) (Copy...cde = 0, 


where the left member denotes the value of C at the origin of the normal 
coérdinate system. 

We may define a set of functions p...g of 2*,..., 2") corre- 
sponding to the normal tensor Kate.. , defined by (9.3) by the equations 


alt 
Capy...¢ 
CBY...0) Pp... GQ” 


in which the derivative on the right is evaluated at the origin of normal 
codrdinates. By a method similar to that employed in § 9 we can show that 
, possesses a tensor character, but this fact may also be inferred 
by observing that Ae ay...0)p...q 18 expressible in terms of the normal tensors. 
T he tensors 4/3, es. , thus constitute a generalization of the normal tensors. 
They are symmetric in the indices afy---o and p---q. In ease 
contains only two terms in the sesinitienile itis a oul tensor ha 4 we shall 
then omit the parenthesis for simplicity. 

The following equations express a few particular cases of the relations 
between the tensors Avapy. _.@)p...q and the normal tensors. These equations 
are obtained by differentiating the identities (6.5) referred to normal coér- 
dinates and evaluating at the origin. The symbols P and S have their previous 
significance (cf.§ 6 and § 11) except that P operates only on the letters @ By d<« 
and S only on the letters p, gq, r: 


(13.4) 


apy...) p. 


(13.5) = P 


578 
(13.6) A’ 


“By) pq 


(13.7) por 
(13.8) p 


(13.9) A 


( 13.10) 


apyd 


0. VEBLEN AND T. Y. THOMAS | October 


1 
Pl ae 
= (Aver — 28 (Aven 
i 
4 P dps 
[ai — 38 (4 1; 
4 “Aapy)dpq (2 vep)p ydq! 


=P { p): 


By differentiating the equations of the type (13.2) and evaluating at 
the origin of normal coérdinates we may express these tensors in terms of 


the functions I. 


i 
«py)p 


(13.11) 


For example 


apy 


i 
+ Prue Apyy + up 4 


ri. 


The generalized normal tensors appear in some of the formulas of 
extension which generalize the formulas of § 12. We here write down 
only the following four particular cases: 


2 1¢ __ om 
(13.12) Tip, Ti, pgr — le r< le Aipgr 


(13.13) 


— 
qr 


(13.14) 


ij, Pd, 


(13.15) 


Ti, par Ti, A — Te, pAigr— Aipr — Te Ating) r3 


a 
Tij, Taj, q+ — ri Ling — Tic Ajpr — Tie,r Ajng 


a 
Taj ia Ajngr; 


a « 
= Ty, pqr— Apgr— Taj,p Aigr— Toj,q Aipr— Tice,» Ajar 


ia,q Ajpr— Taj Atipg)r— Tice A Jpg re 


| 


1923] THE GEOMETRY OF PATHS 579 


The generalized normal tensors satisfy the identity 
(13.16) S( = 0, 


where S( _—) denotes the sum of the terms obtainable from the one in the 
parenthesis which are not identical because of the symmetric properties 
of Avapy.. a)p...q: This identity may easily be proved by the method used for 
the corresponding theorem about the normal tensors in § 9. 

14. The curvature tensor. The normal tensor Aj is related to the 
curvature tensor by the equation 


(14.1) Bia = Aju — 


which is immediately evident on comparing (2.7) with (9.13). The tensor 
character of B follows from that of A. From the definition it follows that 


(14.2) Bia = — By. 
From (9.9) it follows that 


(14.3) Bia + Braj + Bix = 0. 


Also by solving the equations (14.1) and (9.9) for the A’s we obtain 


i 1 i gs 
(14.4) Aja = (2 Bint + Bix) 
or 
i i i 
(14.5) Aji = 3 (Bia + Bij). 


If we write (9.13) in normal coérdinates, differentiate, and evaluate at the 
origin we obtain 


(14.6) Aina, — — Abjnaym 
From this and (14.1) it follows that 


i i i 
(14.7) Byrt,m = Ajkim— Ajtem. 


| 


5x0 0. VEBLEN AND T. Y. THOMAS | October 
The equations (14.7) and (9.10) may be solved for the A’s giving 
(14.8) = 6 (5 Bijt,m + 4 Burj, m +3 Bijm, 2 + Brj,t)- 

If in (14.7) we permute the indices /k, /, m cyclically and add the three 

resulting equations we obtain the identity of Bianchi, 
( 14.9) Bia, m + Bim. k + 0. 
From (14.7) there also follows the identity, 
From (12.7) there follows the important identity of Ricci and Levi-Civita, 
Im.. Im. me 
Ty. — — ij. — < + Ti ‘pq 
(14.11) 
Using the tensors D and E (14.11) becomes 
| 

This identity may be generalized by combining identities of the type (13.12). 
For example, 
(14.13) Ti, p, qr Ti, q.m Te, p Ai — -T. Bing Te 

| 
| 

15. Homogeneous first integrals. A homogeneous first integral of the 
kth degree of the differential equations (2.1) is an equation of the form | 

| 
(15.1) dx? dat 
ap...7 des ds ds 


which holds along every path. From the equations for the transformation of 
dz'/ds it follows that the functions (eg...~ we the components of a covariant 
tensor. We shall now derive some general theorems about the conditions 


1923] THE GEOMETRY OF PATHS 581 


under which a tensor 7,3.» gives rise to a first integral. The first of these 
theorems is given by Ricci and Levi-Civita for the case of the Riemann 
geometry in Chapter 5 of their Méthodes de calcul différentiel absolu. 

If we differentiate (15.1) with respect to s we have 


ds ds ds ds ds ds 


At the origin of the normal coérdinates this equation becomes 


where y,¢ 18 the covariant derivative of The substitution involved 
in obtaining the last equation is permissible on account of (10.1) which holds 
at the origin of the normal codrdinates. The identity 


(15.3) = 0 


where P indicates the sum of the terms obtained from the one inside the 
parenthesis by cyclic permutation of the subscripts, is therefore a necessary 
condition for the existence of the integral (15.1). Owing to its tensor 
character (15.3) has validity in all codrdinate systems. 

To show that (15.3) is also sufficient for the existence of the first integral, 
let this equation be satisfied by the symmetric tensor ajj...x, and express it in 
its expanded form 


Vit tej... dia... — di...) = 9. 
] 


If we consider this equation referred to a normal coérdinate system and 
multiply by (dy‘/ds) (dy//ds) «++ (dy*/ds) (dy'/ds), we obtain 


diy dy? dy’ 


(15.4) ds ds ds ds 


da dx? dat da? dat 4 wi 
Oleg... <P 0 

owing to the equation (7.1). We may also write 


582 0. VEBLEN AND T. Y. THOMAS [October 


on making use of the equation (7.6). Since the derivatives dy‘/ds in (15.4) 
are constant along any particular path, it follows that 


df dy 


along any particular path. In consequence of the tensor character of ay...x, 
we have in general coérdinates 


dz* dat 


a constant. 
ap...7 ds ds ds 


Hence, A necessary and sufficient condition for the existence of a homogeneous 
Jirst integral of the kth degree is that a symmetric covariant tensor of the kth 
order aij...x exist which satisfies (15.3). 

If a symmetric tensor by... and function g(a’, x, ..., 2”) exist which 
satisfy the equations 


(15.5) P(by. = Plby. = 


where by...x,, is the covariant derivative of a function can be chosen 
so that the equation 


(15.6) == 


is satisfied. The bracket contains the covariant derivative of Wbj,.., with 
respect to 2’. In fact, we have 


P{( Whi. 


dlogw 


Hence (15.6) is satisfied if we put 


w = 


1923] THE GEOMETRY OF PATHS 583 


Therefore if (15.5) is satisfied a first integral exists which is given by e~? Dy...x. 
That (15.5) is a necessary condition is immediate, for, as we have seen, if Di... 
furnishes a first integral (15.5) is satisfied with g = constant. 

Hence, A necessary and sufficient condition for the existence of a covariant 
tensor aj...x which satisfies (15.3) is that a covariant tensor bi...x and function » 
exist which satisfy (15.5). If the tensor bi...x and function @ exist, then 


A particular case of (15.3) is 
(15.7) (ij...k,t = 


where aj...x,. is the covariant derivative of ajj..... In a manner similar to the 
above it can then be shown that 


is a necessary and sufficient condition for the existence of a first integral 
which satisfies (15.7), and that this integral is given by ay... = e~° by...x. 

Hence, A necessary and sufficient condition for the existence of a covariant 
tensor dij...% which satisfies (15.7) is that a covariant tensor by...x and function » 
exist which satisfy (15.8). If the tensor bij...x and function @ exist then 


Aij...k = by. . 


The equation (14.12) provides a new statement of this last theorem. If the 
tensor jj,..x satisfies (15.8) we obtain by covariant differentiation 


(yy Pm + i,m): 
Hence, 
b —h.. == 


ij...k,l,m ij...k,m,l 


From (14.12) we then have 


(15.9) bap...» Bog = 0. 


584 0. VEBLEN AND T. Y. THOMAS [October 


Conversely, if the tensor b,,_ ,, and vector g, satisfy (15.9) and 
(15.10) 
where j...x,. is the covariant derivative of bj...., we have 


ij... l,m ij 


and 
b 


ome 
ij. Keyl, m ml ij...k \P i,m 


l 


Since satisfies (15.9) 


so that 
or 
| 
a! 


and this last equation is the condition that g, be the gradient of a sealar 
function g(a’, z,..., 2"). 


0 
¢ 
Hence, A necessary and sufficient condition for the existence of a covariant 


tensor ;, which satisfies (15.7) és that a covariant tensor and vector 
exist which satisfy (15.9) and (15.10). If the tensor b,,_ ;. and vector g, exist, 
then 


= by. x. 


16. Algebraic condition for existence of first integrals of a par- 
ticular class. We shall now derive a condition on the functions 7 for the 
existence of a homogeneous first integral of the kth degree which satisfies the 
particular condition (15.7). The condition is to involve only the algebraic 
consistency of a set of tensor equations formed from the functions /. If the 


1923] THE GEOMETRY OF PATHS 585 


covariant tensor of the Ath order ajj...x satisfies the condition (15.7) it follows 
by (14.12) that ay.... will satisfy a sequence of equations of the form 


Dif 7 == @, 


ij...klm,r;: 


id, 


j...klm,71, 


(16.1) 


ap... 
where 
of...7 arp... 
Dif ike Brim 
and Di?" Yepresents the nth covariant derivative of The 


algebraic consistency of the equations (16.1) is a necessary condition on the 7's 
for the existence of the homogeneous first integral of the Ath degree which 
satisfies (15.7). 

The algebraic solutions of the equations (16.1) possess a tensor character. 
For let aj...x represent an algebraic solution of (16.1). Under a general 
transformation of coérdinates the first set of equations of (16.1) becomes 


( 16.2) i, 0, 


ij 


where is defined by the equations of transformation 


(16.3) 
onl OX 


41 


= 

| 


586 0. VEBLEN AND T. Y. THOMAS [October 


and @q,.. » represents an algebraic solution of (16.2). Substituting (16.3) in 
the first set of equations of (16.1) we obtain 


(16. ——.... —> = 0. 


If we multiply (16.4) by (0a4/0r/) --- and sum for 
(p.q,....#), then 


a ‘ ij...klm 
Ort 0g 


and a comparison of these equations with (16.2) shows that a solution of (16.2) 
is given by 


While we have considered the first set of equations of (16.1) a similar result 
would have been obtained with regard to any other set. Hence the algebraic 
solutions of (16.1) are tensors and it is consequently permissible to form the 
covariant derivative of these solutions as we shall do in the later work. 

Let us now assume the algebraic consistency of the equations (16.1) and 
suppose that the first system of these equations admits a set of fundamental 
solutions denoted by bi/” x, p = 1, 2,....5 s. The general solution of this 
system of equations can then be expressed as a linear combination of the 
fundamental solutions );/”.., with arbitrary functional coefficients. We next 
consider the first and second systems of equations (16.1) and suppose that 
these equations have a fundamental set of solutions cj” x, p = 1,2,..., ¢, 
in which of course s>?. If s = ¢ then of x, p = 1, 2,..., ¢ will furnish 
a fundamental set of solutions of the first system otf equations which satisfies 
the second system. If s >? we consider the first three systems of equations, 
which we may suppose to have a fundamental set of solutions df x, p=1, 
2,..., u, With the condition In case ¢ = then dy x, p = 1, 2, 

.,u, Will furnish a fundamental set of solutions of the first two systems of 
equations which satisfies the third system. By proceeding in this way we 
shall finally come to a point where the first V systems of equations of (16.1) 
will admit a fundamental set of solutions which satisfies the system imme- 
diately following in the sequence. Hence to say that the equations (16.1) are 


w 
Ox Ox 
i k > 
ort ork 


1923] THE GEOMETRY OF PATHS 5RT 


algebraically consistent implies that there is a number N such that the first V 
systems of equations (16.1) admit a fundamental set of solutions 7}... p = 
2,...,8, Which satisfies the equation 


ap...7 == 


The general solution of the first NV systems of equations is then 


where the expression on the right is summed for @, and g is an arbitrary 
function of 2*,..., 2”). 

Before proceeding further with the general case let us consider the particular 
case where the first system of equations (16.1) has a unique solution ajj...x 
which satisfies the second system, i. e., 


(16.6) 
up...7 


ij...klm,7 


Under these conditions a homogeneous first integral of the Ath degree will 
exist whose covariant derivative vanishes. For if we differentiate the first 
system of equations (16.1) covariantly we obtain, on account of (16.6), 


ap...7 


Where (g3.._y,, is the covariant derivative of a,3,. Since (16.7) possesses 


a unique solution a;;...x. it follows that 


in which g, is a covariant vector. The above statement then follows from the 
last theorem of § 15. 

Going back to the general case let us substitute one of the fundamental 
solutions asf” «in the equations of the sequence (16.1) through the (V+1)th. 
We may then differentiate these equations covariantly so as to obtain the 
following: 


41" 


= 


588 0. VEBLEN AND T. Y. THOMAS [October 


(p) ap...7 
(p) ap...7 
ep. ..7,7 Dy. = 0, 
(16.8) 
(p) Pr. 
( 
where the covariant derivative of a‘ Since a is a solution 
ap...7 ij...k,l 


of (16.8) it may be expressed linearly in terms of the fundamental solutions 
of these equations. Hence 


(16.9) ag...kl = 


where the expression on the right is summed for @, and the 4’s are covariant 
7 (p 
vectors. Since aj}. x satisfies the first system of the sequence (16.1), 


(16.10) ap -k, l,m ap) .k,m,l 0. 


If a; Pp Kt as given by (16.9) be differentiated ey and substituted in 
(16. 10) there is obtained the following condition on the 4’s 


(16.11) — — = 0. 


If we substitute (16. “" in (15.7) we see that it will be satisfied if a set of 
functions 9”, p= 1,2,....: s can be chosen so as to satisfy the equations 


(16.12) +y (@) 


Such & set of functions can be chosen, for in consequence of (16.11) these 
equations are completely integrable. This set of functions g™ will determine 


0. 
4 


| 
1 


1923] THE GEOMETRY OF PATHS 589 


according to (16.5) a covariant tensor of the kth degree aj..., whose covariant 
derivative vanishes. 
Hence, a necessary and sufficient condition for the existence of a homogeneous 


Jirst integral of the kth degree aij... which satisfies (15.7) is that there exists 


a number N such that the first N systems of equations (16.1) admit a fundamental 
set of s solutions (s => 1) which satisfy the (N+ 1)th system of equations. 

17. Special cases. The theorems of the last two sections have some 
interesting applications in the linear and quadratic cases. It is natural to 
define a field of parallel covariant vectors by means of a set of functions h; 
such that 


(17.1) hij = 0. 


For this means that if normal coérdinates are introduced with origin at an 
arbitrary point, we have at this point 

adh; dy“ 

ds ds 

By the third theorem in italics in § 15, a necessary and sufficient condition 
for the existence of a field of parallel covariant vectors is the existence of 
a function g and vector A; such that* 


The last theorem of § 15 now shows that a necessary and sufficient condition 
tor a field of parallel covariant vectors h; is that a covariant vector A; exist 
which satisfies the equations 


(17.4) A; ; A; Pj» 
(17.5) Aa Bix = 0, 


where ¢; is a covariant vector and A, , is the covariant derivative of 4;. The 
theorem of § 16 shows that a necessary and sufficient condition for a field of 


* Eisenhart, Proceedings of the National Academy of Sciences, vol. 8 (1922), 
pp. 207-212, defines A; as a field of parallel vectors, and finds the condition (17.5) for 
their existence. 


i 
| 


590 0. VEBLEN AND T. Y. THOMAS [October 


parallel covariant vectors is that there exists a number N such that the first V 
sets of equations of the sequence 


Bi 0, 
Ae Bint = 


(17.6) Ae m 0, 


admit a fundamental set of s solutions (s >1) which satisfy the (V+ 1)th 
set. In particular a sufficient condition is obtained if the first system of 
equations of (17.6) be algebraically consistent and all their solutions satisfy 
the second system of these equations. 

Going now to the quadratie case we see from the third theorem in italics 
in § 15 that the condition on the functions 7 for the geometry of paths to 
become a Riemann geometry is that a tensor g,, exist such that 


The equation (17.7) without the condition that the vector gy, be the gradient 
of a scalar function gives the geometry upon which Weyl bases his electro- 
magnetic and gravitational theory, for this equation is equivalent to the 
equation (2.10). By the last theorem of § 15, the condition (17.7) can be 
written 


(17.8) I ij 


(17.9) Baa + Gia Boa = 


This shows furthermore that a necessary and sufficient condition for the Wey] 
geometry to become the Riemann geometry is that the tensor %ij satisfy (17.9). 

The theorem of § 16 shows that a necessary and sufficient condition for 
the geometry of paths to become a Riemann geometry is that there exists 
a number JN such that the first V systems of equations of the following sequence 


1923] THE GEOMETRY OF PATHS 591 


admit a fundamental set of s solutions (s>1) which satisfy the (V+ 1)st 
system of equations: 


Biya Boy = WV, 


(17.10) Go; By 


ikl, m, it 


+ Gie By 


jkl,myn 


In particular* we have that a sufficient condition for the geometry of paths 
to become a Riemann geometry is that the equations 


t+ gia Bua = 0 
be algebraically consistent and that all their solutions satisfy 
Gej Bit,m + = 0. 
18. The homogeneous linear first integral. From the first theorem 
of § 15 it follows that a necessary and sufficient condition for the covariant 


vector /; to furnish a linear first integral, 


(18.1) he constant, 
is that the equation 
(18.2) hij + hy, 


be satisfied, i. e., the covariant derivative h;,; must be skew symmetric in the 
indices 7 and j. The equations (14.12) show that 


(18.3) hi,n,j = he Bixj. 


* Kisenhart and Veblen, Proceedings of the National Academy of Sciences, 
vol. 8 (1922), pp. 19-23. 


592 0. VEBLEN AND T. Y. THOMAS [October 


By (18.2) these give rise to 


hijk+ = he 
(18.4) + = Ne 
hinit Mijn = he 


If we add these three equations we obtain 


Combining (18.5) with the second equation of (18.4), we have also 
(18.6) he Brij. 


These are integrability conditions obtained by consideration of second deri- 
vatives. In order to obtain those involving third derivatives we use (13.12), 
which with (18.2) gives 


« 
hi, par Ny, iyi 2 hea Aipr + 2 he. r Aing 2 he Aipgr- 


rs «“ ¢ « 
(1 8.7) Ng, ipr hi. qpr he. Agir + 2 he, r Agip Zhe Agipr 


hyp, gir ha, pir = 2 Aner 2 her Angi Sh. Apgir 


If we add the first two of these equations and subtract the third, we obtain 


« « 
hi, par he, p Aigr he, q A ior he.i Aner 


(18.8) 
he,r ( Aing + Apai) + he + Aneir 


i 


1923] THE GEOMETRY OF PATHS 593 


Now interchange the indices g and y and subtract the resulting equation from 
this one. We obtain 


(18.9) Borg + he,p + he,r Brip + he ( + Brri,g) = 9. 
If we collect the terms in the equation (18.9) we have 


(18.10) lie + Ne, 
where 


(0; + OF + 05 6; 0; + df ). 


C“ is a tensor which is contravariant of the first order and covariant of the 
fourth, D“’ is a tensor which is contravariant of the second order and 
covariant of the fourth. The covariant indices of these tensors have been 
omitted for simplicity. If we differentiate (18.10) covariantly, we obtain 


he i | he, 3.3 D he. = U, 


and this becomes 


ap 


(18.11) Neg + Ne CG + hy Blag Dj = 0 


when we make the substitution (18.6). This equation may be written in an 
abbreviated form as follows: 


(18.12) ha Cr + Dy? = 0. 


Covariant differentiation of (18.12) will give rise to a new equation which 
can in its turn be abbreviated to the form (18.10), this process requiring the 
use of (18.6) to eliminate h, ,,. Continuing in this way we obtain an infinite 
sequence of equations. For the purpose of convenient reference we shall 
write this sequence with the equation (18.2) as the first member: 


594 O. VEBLEN AND T. Y. THOMAS | October 


ap 


Nig + he, D 0, 


“3 
he +heg Dy” = 0, 


(18.13) 


ap 


+ he,» Di’ = 0, 


h 


« 
« ( n 


The algebraic consistency of this set of equations, regarded as equations 
for the determination of h; and h;,;, is a necessary condition for the existence 
of a first integral h;. Hence as in § 16 there must be a value of N such 
that the first V+ 1 sets of equations admit a fundamental set of solutions 
hi”, ni? (p =1,2,...,8) each of which will satisfy the system of equations 
next following in the sequence (18.13). This necessary condition turns out 
also to be a sufficient condition. 

Before proving this in general, let us consider the special case in which 


N = 1 and s = 1. In this case (18.2) and (18.10) are consistent and 
possess a unique algebraic solution consisting of a set of functions /i, 
i= 1,2,..., m, and a set of functions hj, 7, 7 = 1, 2, ..., , which 


satisfy (18.11). It may now be shown that the solutions ); and jy are tensors, 
so that it is possible to substitute these quantities in the equation (18.10) 
and differentiate it covariantly. Doing this we obtain 


ye “p 
(18.14) Nei he hep, i + hes = 0. 


If we subtract (18.14) from (18.11) with Hj replacing h;,;, we obtain 


(18.15) (hai — Itai) + (ty — i) p® = 0. 


. . . . 


1923] THE GEOMETRY OF PATHS 595 


In (18.15) the coefficient of D“ is skew symmetric in the indices @ and £. 
By hypothesis, the solution of (18.15) and (18.2) is unique and consequently 
the solution (/i¢; —he,i), (ry Bis —heg,i) can only differ from the solution 
he, hag by a factor of multiplication. Hence 


(18.16) hij hi, hi, 


(18.17) he hi, j,k hij, 


where 9; is a covariant vector. If we differentiate (18.16) covariantly, 
obtaining 


hij. = Pi whit hin, 
and from this form the expression 
(Qj,% — hi = — hing + — + — Nin, 


we find on substituting the equations (18.16) and (18.17) in the right member 
of this equation that it vanishes identically. The functions %; are not all 
identically zero, for if so it would follow by (18.16) that the functions hj; are 
also identically zero, contrary to the assumption that (18.2) and (18.10) are 
algebraically consistent. Hence 


—- j = 
The vector @; is therefore the gradient of a scalar function ¢, i.e., 


Og 


Now we shall have a first integral if a function yw exists such that 


(18.18) (Whi); = Why, 


« 


596 0. VEBLEN AND T. Y. THOMAS [October 


where (wh;); denotes the covariant derivative of Wh;. For if this equation 
is satisfied, wh; will be a covariant vector satisfying (18.2) and hence will 
give a first integral. Expanding (18.18) 


ow 


Whi, j = Whi, 
or 
(18.19) hy— hij = Phi, 
where 
= —- § 
aa and log w 


The gradient 4; is a covariant vector and consequently (18.19) will be satis- 
fied if we put 


— y. 


Hence, a sufficient condition for the existence of a linear first integral is 
that (18.2) and (18.10) be algebraically consistent and that they possess 
a unique solution which satisfies (18.11). 

Let us now return to the general case and assume that there is a value 
ot N such that the first V+ 1 systems of equations (18.13) admit a funda- 
mental system of solutions hj”, hi”, p = 1, 2, ..., s, each of which satisfies 
the system of equations immediately following in the sequence. By the same 
argument as before, h}” and hip? are tensors for all values of p. The general 
solution of the first V+ 1 systems of equations is then 


(18.20) h, = 9 h®, 

where the terms on the right are summed for « from « = 1 to@ = s. If we 
differentiate the equation 


(18.22) he? +1 = 0 


__ 


1923] THE GEOMETRY OF PATHS 597 


covariantly, and subtract it from the equation 


(18.23) he? Cy + hag DY? = 
we obtain 


If we next differentiate (18.23) covariantly and subtract it from the equation 
immediately following in the sequence, we have 


ap 


(p) ) (p) (p 
(he — hei) + (hy Brug —hepi)D™ = 0. 
Continuing in this way we obtain the equations 


(i... hei) om ae (hy Brag — hap 0. 


CE + Blog = 0. 


(hes —hei)Cn+ (hp Blog —hepi)Dn = 


(p) 
The term (h, Biag — hep, ;) is skew symmetric in the indices and 8, and 


we may therefore express the quantities (he —hep.i) as 
a linear combination of the particular solutions h‘”’, p = 1, 9, 0: 
(18.25) —hip = 


(18.26) — BE = 


PPLE 
0, 


598 0. VEBLEN AND T. Y. THOMAS [October 


To determine the condition which the covariant vectors 2,” must satisfy we 
differentiate (18.25) covariantly, obtaining 


(k) pike ) 7(@ (heat) 7(@) 
he = q +4 h; An hy. q? 
or 
k) 2th) pa 9 (kee) 7,(@) (ke) (kee) ¢7(@) 1 9(@p) 


If we interchange p and q in (18.27) and subtract these two equations we 
find that 


(18.28) ( — 4. 408) — Q, 


We next differentiate (18.26) covariantly, 


(k) (k) (k) « pm (a@) 
hip, — he. r Boip he Brin, r Su, r 4 ho hip, 
or 
(k (kp) k) ¢ (ke) 
a. = = he ) Brip + Brin, + r 
(18.29) 


Interchanging 7 and q in (18.29) and subtracting the two equations, 


is «) — ates 4. axe kA) 4 ‘)) 


(18.30) 
+ ( Boip,r — Brin, +e Br + Bi» = 0. 


This equation reduces to 


ip 4,7 4 4 


since h;, hij is a solution of (18.9). From (18.28) and (18.31) we now 
deduce that 


| 


1923] THE GEOMETRY OF PATHS 599 


for if (18.32) were not satisfied there would be a linear relation among the 
solutions contrary to the hypothesis that h!”, k = 1,2,....: 
is a fundamental set of solutions. 

A linear first integral h; will be determined by (18.20) if gp can be chosen 
so that the equations 


(18.33) = US 


are satisfied, where the term on the left is the covariant derivative of p hi”, 


for the covariant vector h; = y° h\® will possess a covariant derivative i,» 
which is skew symmetric in i and p. Expanding (18.33) 


09 = 0. 


(18.34) 
Cw 


From (18.25) we find that the condition on the g’s can be put in the form 
(18.35) 4 — 


The integrability conditions of (18.35) are the equations (18.32) and hence 
a set of g’s can be found which will satisfy (18.33). 

Hence, a necessary and sufficient condition for the existence of a linear first 
integral (18.1) is that the I’’s be such that there exists a number N such that 
the first N-+-1 systems of equations (18.13) admit a fundamental set of s 
solutions (s > 1) which satisfies the (N-+-2)nd system of equations. 

19. The homogeneous quadratic first integral. A necessary and 
sufficient condition for the existence of a homogeneous quadratic first integral 


dx dx? 


(19.1) Gus —— = constant 
P ds ds 

such that 

(19.2) Yi = Gii 


is that gj satisfy the condition 


(19.3) + Gip,i + = 0, 


| 


600 0. VEBLEN AND T. Y. THOMAS [October 


where gi,» is the covariant derivative of gj. By differentiating (19.3) 
covariantly, we obtain 


By substituting (12.4), (12.5) and (12.6) in these three equations we obtain 


(19.4) Vij, pa Yip, iq + = 
(19.5) Yii.par + Yip, iar + GYvi.igr = Piipar: 
(19.6) Yij, pars + Yip, igrs + = Pijpgrs; 
where 


(19.7) Pijpg = 2 ( Gie + Apia + ), 


« 
Pijpgr 2 (Gie,g Ajpr + Apir + Gpa,q Aijr 
«“ 
( 19.8) + Gia, Ajng + Gia, re A nig + Gpe, r 


Ajngr A nigr Gpe Aijor }, 


a « «a 
Pijpors 2 ( qr Ajns T Gia,rs 4 Ling + ie. sq Ajpr 
Die, qr Ains Gie, rs Ainy Dia, 8q Aipr 
' « a | 
+ Ype,qr Aijs + Ype,rs Aiig + Gpe,sq Aiji 
(19.9) + Gie,g Ajprs + Jie,r Ajpgs + Yia,s Ajpar 
Jje,q Aiprs + Aings T Dija,s Aipg 


@ 
+ Gpe,q Aijrs + Aijgs Gpe, s Aiin 


+ Jiu Ajpgrs + die Aipars + pe Aijgr ). 


| 
4 


1923] THE GEOMETRY OF PATHS 601 


The P’s are tensors which are symmetric in the first three indices and also 
in the remaining ones. Thus 


Piipg = Ping = Pipjy ’ 
(19.10) Pijpgr Piipar = Pijprg etc., 
Pijpars Piipgrs — ijpqsr etc. 


The equations (19.4) can not be solved for gij,p¢ but may be solved for the 
difference of two of these extensions, namely 


1 
(19.11) = 2 (Pipa + Piiap — Pipgi — Pipai): 
‘ We may however solve* (19.5) for gij, pgr, thus 


1 
YGij,pqr 3 (Piipgr + Pijqpr + Pijrpq + Pygrij) 
(19.12) 


= 


1 
(Pivair + Pirnia + + Pipair + Pirvig + 


Similarly from (19.6) 


1 
pars 3 ( Pijpors Pijqprs + Pirpgs + Parijs) 
; (19.13) 


i 6 Pipairs + P, irpjqs ignrjps + P pairs Pirpigs +2 ) « 

The equations (19.11) and (19.12) constitute integrability conditions arising 
LF from second and third derivatives respectively. By interchanging yr and s in 


(19.13) and subtracting we obtain the integrability conditions arising from 
| the fourth derivatives: 


(2 Pijraps + 2 Progrijs + Pispjar + Pigsjpr + Pispiqr + Pyosipr ) 
19.14) 


(2 Pisopr + 2 Pogsijr + Pirpjgs + Pigrips + Pirpigs + Pigqrips) = 0. 


*The solution is facilitated by noticing that the tensors in the left member of (19.5) 
can be regarded as notation for the vertices of a Desargues configuration. 


42 


602 0. VEBLEN AND T. Y. THOMAS [October 


If we substitute the value of Pijpgrs given by (19.9) in (19.14) we obtain an 
equation which we may write in the form 


,ap ,apyod 
(19.15) ijpars + V ijnars pore = 
where U, V, and W are tensors. 

Let us next consider the identity 
(19.16) Gii,pa— Ainy ~ Jie 


(ct. (12.4)). The third covariant derivative gj, ,,¢,, may be evaluated in terms of 
gj and gi,» by setting the value of gjj, pq given by (19.12) in the identity (12.5): 


Yij.p,q,r = Yei.p Ajrg Yai,g Arp T Ajap YJaj,p Arig Yaj,a 
+ Gir Agip + Gap,i Agri + Agri + + Aija 
Jagq,i + Gag, j Jaq, p Air + Ajjp Apyj 
(19.17) + Jar, j + Jar,p Jar, Aijp — Gij,« 
+ Yee | ym + Ajrpq) + Geaj ( Abipr + Ariqp ) 
T Jap ( Abrij) + ( Aprij ) 


a a 
Jar ( Aijing A De 


If we differentiate both members of (19.16) covariantly and substitute for 
Yii.p.q.r its value from (19.17), we obtain an equation which we may write as 


= apy 
(19.18) Vij.pa.r Jap Egon Fuser? 
where £ and F are tensors. Next differentiate (19.15) covariantly, obtaining 
Jap.i + Juz of + Jap, J a 


were + Vag, yo wr" = 0, 


T Gap, ye,i 


(19.19)» 


1923] THE GEOMETRY OF PATHS 603 


in which we have omitted the covariant indices for simplicity. When we 
make the substitutions (19.16) and (19.18) this equation becomes 


Yup, i + Jug + (Yap, yi — Jos — Jao ) 
(19.20) 
+ (Gan + WP" + Gap, ye = 0 
and may be written in the form (19.15) as 
(19.21) Jap T Yap, y Jap, yo = 


By covariant differentiation of (19.21) and substitution for gj,p,¢ and gij,pq,+ 
from (19.16) and (19.18) we again obtain a system of equations of the form 
(19.15). Continuing this process we are led to the following sequence of 
systems of equations. As in the case of the linear first integral we add to 
this sequence the conditions (19.3) and (19.4) and also the symmetry con- 
ditions on the g’s for the purpose of convenient reference: 


Yii,p Yip,i = %; 
+ Yiv,ig + Ypi,ig = 2 (Gie Aipg + Gia + Jpe Aijg); 
Yi = = = Gii.vas = 


yo 


= 
Jug T y T yo 


, 
Jap l 1 T IJup,y + 


(19.22) 


U n + y Ve + W an 0, 


604 0. VEBLEN AND T. Y. THOMAS [October 


The algebraic consistency of the equations (19.22) is a necessary condition 
for the existence of a homogeneous quadratic first integral. As in the pre- 
ceding cases there must therefore exist a number NV such that the first V+ 1 
systems of equations of (19.22) possess a fundamental set of s solutions g{; 
ing =1, 2,...s8) each of which satisfies the (V+ 2)nd system of 
the sequence. We shall show that this is also a sufficient condition for the 
existence of the quadratic integral (19.1). 

We first take the case in which VN = 1, ands = 1. The first two systems 
of equations (19.22) then possess a unique solution which satisfies (19.20). 
This solution possesses a tensor character so that we may substitute it 
in (19.15) and differentiate covariantly, obtaining 


Jeg, i Yap of + Gupy,i IJupy J 
(19.23) 


apyo 


+ Iupyo W = 0. 


Jupyeo.i 


Subtracting (19.20), into which the solution jj; gijp Jijpg has been substituted 
instead of Jii,pq, from (19.23), 


(Jeep, i GJapi) v i—Gapyi T Jap + Jaa A3,;) 


(19.24) 
pay 
i 
The solution appearing in (19.24) satisfies the first system of equations (19.22) : 
in the summed indices and is consequently given by 
(19.25) — = Pr 
(19.26) Giiv.a — Giiva + + Yiu = Pq 
apy 


1923] THE GEOMETRY OF PATHS 605 


The covariant vector g; is the gradient of a scalar function, for if we differen- 
tiate (19.25) covariantly, 


Gij,p,.a— = Gi Pp 


Hence, 


(19.28) (Qp,4— Pa,p) Hii — T Yiia.p — Giip,a T Pa — Pov Gij.a- 


If we substitute the values given by (12.4), (19.25), and (19.26) for the 
covariant derivatives in (19.28) we find that the right member vanishes iden- 
tically. In the left member of (19.28) g can not be equal to zero, for if this 
were so we see from (19.25) and (19.26) that gj» and gijpg Would also vanish, 
which is contrary to the assumption that our equations are algebraically 
consistent. Hence 


Pup = 9, 
or 
(19.29) = 
A homogeneous quadratic first integral (19.1) will exist if a function w can 
be chosen so that 


(19.30) (Wgij)p = WGiip, 


where the left member denotes the covariant derivative of wg. If we 
expand (19.30) we find that the condition takes the form 


dlogy 


(19.31) 


Pp = 0. 


Therefore (19.30) is satisfied if we put 
y= e?, 


Hence, a sufficient condition for the existence of a homogeneous quadratic 
jirst integral (19.1) is that the first two systems of equations (19.22) possess 
a unique solution which satisfies the third system. 


606 0. VEBLEN AND T. Y. THOMAS [October 


We return now to the general case and assume that there exists a number V 


such that the first (V+ 1) systems of equations (19.22) admit a fundamental 
set of s solutions ; a=—1,2,...,: s) each of which satisfies the 
(V+ 2)nd system of equations. The general solution of the first (V+ 1) 
systems of equations may then be written 


(a) 
(19.32) Ii; J@, 
(19.33) Vip = 

19.34) = («) 


where the right members are summed for « from « = 1to@=—s. Let us 
substitute the particular solution g/; g/*; gi in the equations (19.22) begin- 
ning with the second system and ending with the (V+ 2)nd. If we then 
differentiate each system of equations through the (.V + 1)st covariantly, and 


subtract it from the system immediately following, we shall obtain the equations 


hk k up A (kh o yupy 
(Gop, i Japyi Yo3 AC. vi T Guo A3,;) J 


(k k) (k) upyo 

k (hk o k) o ru py 

(19.35) 
(k) (k) k) (k) a (k) ,upy 
Ja si) Uy i Japyi + Jap Aayi + Jaa A N-1 
(k) (k) (k) pay __ 
T (Yupye, i yoi —1 0. 


on 


& 


| 


1923] THE GEOMETRY OF PATHS 607 


Since the solution which appears in (19.35) satisfies the first (.V+ 1) systems 
of equations (19.22), it may be written as a linear combination of the 


fundamental solutions gi); = 1, 2,..+.8. Hence 
22 k) — glk) — (ke) 

o (k) o a(ka) (a) 
(19.37) Yiiv.a Aipa + = Yip: 


the left members of these equations being summed for e« from « = 1 toe=—s., 
To find the conditions which the covariant vectors 2 must satisfy we proceed 
in the same way as for the linear integral and thus obtain the equations 


(8) — 4. — ley) — 


3 4 ) (a7) 9(yvB) 
(19.39) (ae?) — + — — 0, 


ijp 


ay) — 


8s 


which are summed for 8 from 8 = 1 to 8 = s. It follows consequently that 


(ap a(ay) (yp 


for otherwise one of the fundamental solutions could be expressed linearly 
in terms of the others. 

A homogeneous quadratic first integral (19.1) will exist provided that the 
arbitrary functions y in the general solution can be so chosen that 


( 19.41) ( gp 


é 


608 0. VEBLEN AND T. Y. THOMAS 


where the term on the left denotes the covariant derivative of p 9‘. If we 


expand (19.41) we find that the quadratic integral will exist if the y’s can be 
chosen so as to satisfy the equation 


op (a) 9(@B) 
+9 


(19.42) 
The integrability condition of (19.42) is the equation (19.40) so that a set of 
y's can be found which will satisfy (19.42). 

Hence, a necessary and sufficient condition for the existence of a quadratic 
Jirst integral (19.1) is that the T’s be such that there exists a number N such 
that the first (N+1) systems of equations (19.22) admit a fundamental set 
of s solutions (s >1) which satisfies the (N-+-2)nd system of equations. 

PRINCETON UNIVERSITY, 

PRINCETON, N. J. 


